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Reality trumps rhetoric 


As the shock of Donald Trump’s election subsides, it is time for scientists and politicians alike to 
engage with core issues — from climate change and energy independence to social inequality. 


as president-elect Donald Trump is surely realizing by now. It is 

impossible to know what direction the United States will take 
under Trump's stewardship, not least because his campaign was incon- 
sistent, contradictory and so full of falsehood and evasion. People have 
not voted for his policies, because what are they? Americans have instead 
put their faith in an ideology, and rejected a political class and system 
that they feel, with some justification, has moved on without them. 

In an Editorial last month, this journal argued that Trump was 
unsuitable for office (see Nature 538, 289; 2016). His contrary approach 
to evidence, disrespect for those he disagrees with, and toxic attitudes 
to women and other groups have no place ina modern democracy. His 
election gives Trump the chance to prove the many people who shared 
that view mistaken. And that, we know for sure, is one thing Trump 
relishes. For those who opposed him, now is not the time to turn away 
from politics. There can be no normalizing or forgetting the malignant 
words and attitudes that Trump used on the campaign trail. But it is 
time to engage and to address the issues in a constructive manner. 


Ts is a huge difference between campaigning and governing, 


ANATION DIVIDED 

Many US citizens, including a fair number of scientists, might not like it, 
but Trump is a reflection of the United States today. He is a reminder of 
the deep schisms — economic as well as cultural — in American society 
and beyond. Academics, in particular, must break out of their cultural 
bubbles and work to understand the sentiments behind Trump’ rise. 
There are elements of his agenda, including his attention to the plight 
of many working-class citizens who have missed out on the economic 
gains of the past 25 years, that truly merit attention. We need to better 
understand the causes and consequences of inequality, including how 
technology and globalization are reshaping the economic landscape. 

In a World View on page 331, Daniel Sarewitz appeals to the prag- 
matist in Trump and suggests some science-based policies that he 
could support, if he wishes to make good on his promises to improve 
life in the American heartland. 

Opponents should look for opportunities to compromise, too. Both 
Trump and Hillary Clinton called for increased spending on infrastruc- 
ture, for instance, and this would be a good thing, if done wisely. As 
Barack Obama and Clinton said last week, the country — and indeed 
the world — needs Trump to succeed. But success in governance does 
not mean the implementation of a divisive agenda that caters to an 
angry minority, which is precisely what Trump’s campaign was built 
on. The Democratic party still wields considerable power in the Senate, 
and it must use that power to oppose Trump's extreme agenda. 

Much will depend on whom Trump appoints to key posts in the 
White House and the government. Trump’s selection of Reince 
Priebus, chairman of the Republican National Committee, as his chief 
of staff may represent a pragmatic effort to mend ties with mainstream 
Republicans. But given the cheers coming from white nationalists, it 


should come as no surprise that the ideologues will have their seat at 
the table as well. Trump has announced that his chief strategist will be 
none other than Stephen Bannon, who is executive chairman of the 
right-wing media site Breitbart News. 

Trump has also selected a prominent climate sceptic, Myron Ebell, 
to head the transition team at the Environmental Protection Agency. 
Ebell, who is a policy director at the Competitive Enterprise Institute, 

an advocacy group in Washington DC, has 


“Trump is no scientific training, and has surely been 
unlikely to get appointed only to take a wrecking ball to the 
everything he Obama administration's climate regulations. 
wants, evenwith Here again, researchers must remain engaged 
a Republican and oppose all efforts to ignore the science on 


this or any other issue. Nature will play its part. 

During his campaign, Trump advocated 
energy independence for the United States. But he has railed against 
subsidies for solar and wind power, and promised to tear up regula- 
tions that aim to trim the expansion and continuation of fossil-fuel use. 
The response to Trump will be shaped by how he adapts such firebrand 
messages to political and economic realities. Like President Obama, 
Trump is unlikely to get everything he wants, even with a Republican 
majority in both houses of Congress for at least the next two years. 
Indeed, less than a week after the election, he was already taking back 
some of his pledges. The “big, beautiful wall” he promised has been 
reduced to a fence in some areas. And after meeting Obama, Trump 
announced that he may seek to maintain parts of Obama’ signature 
health-care reform. 

Trump should drop his pantomime-villain act on climate change. 
If he does not, then, come January, he will be the only world leader 
who fails to acknowledge the threat for what it is: urgent, serious and 
demanding of mature and reasoned debate and action. 

The world has made its decision on climate change. Action is too 
slow and too weak, but momentum is building. Opportunities and 
fortunes are being made. Trump the businessman must realize that 
the logical response is not to cry hoax and turn his back. The politician 
in Trump should do what he promised: reject political orthodoxy and 
listen to the US people. His Republican Party’s position on climate 
change is rooted in the past. A March Gallup poll found that public 
concern about the problem had reached an eight-year high: some 64% 
of Americans, it suggested, worried a “great deal” or a “fair amount” 
about global warming. That is a lot more than the percentage who 
voted for President Trump. 

There are nine more weeks until Trump swears to faithfully execute 
the office of president. As he does so, the world can only hope that 
Trump will respect evidence and expert advice. He has proved that he 
can stand up to withering criticism, from Democrats and Republicans 
alike. But can he change his mind in the face of hard facts? This is a true 
test for any leader, and Trump’ legacy may well ride on the answer. = 


majority.” 
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Maths for all 


Explaining concepts to the ‘manin the street’ 
isa worthy goal, but is not always feasible. 


brilliant and the most influential mathematician of the 

twentieth century. His reach continues today. Among other 
things, he popularized a common concept in the communication of 
science: the ‘man in the street’, whose understanding (or not) of a 
problem is commonly used as a benchmark for intelligibility. 

At the 1900 International Congress of Mathematicians in Paris, 
Hilbert set out to list the most important open problems of the field 
for the new century. But he also emphasized communication. “A 
mathematical theory is not to be considered complete,’ he said, “until 
you have made it so clear that you can explain it to the first man 
whom you meet on the street.” Hilbert attributed the saying to “an 
old French mathematician”. 

That is unusually imprecise for a mathematician. So, to complete the 
theory, two intrepid maths historians set out to identify Hilbert's elderly 
Gallic source. As they report in this month’s Historia Mathematica, 
they pursued a long paper trail, including a nineteenth-century letter 
published in Nature, and eventually succeeded (J. Barrow-Green and 
R. Siegmund-Schultze Hist. Math. 43, 415-426; 2016). 

It was indeed a Frenchman, one Joseph Diaz Gergonne (1771- 
1859), who first referred to the man in the street. In a letter dated 1825, 
Gergonne wrote that one has not said the last word on a theory until 
one has been able to explain it to a passant dans la rue — French for 
‘passer-by in the street’. A year later, in a second letter, he went further. 
A formula or method that could not be explained to a passing stranger 
“does not deserve to see the light of day”. 

That’s a big ask, because the man (or woman) in the street sets 
the bar high. As most readers of Nature will know well, explaining 
your research to a non-expert is often challenging. Certainly, Hilbert 
acknowledged that the goal was an exaggeration. A more realistic 


D avid Hilbert was extremely absent-minded, extraordinarily 


one might be that attributed to Albert Einstein, who said (perhaps 
apocryphally) that things should be made as simple as possible, but 
no simpler. 

That tension must have been on the mind of Nobel prize committee 
member Thors Hans Hansson in October. Ata press conference, he was 
tasked with explaining the motivation for awarding this year’s physics 
prize to David Thouless, Duncan Haldane and Michael Kosterlitz, who 
brought topology into physics in unexpected ways. In an attempt to 
illustrate the concepts involved, Hansson produced a bagel, a pretzel 

and a cinnamon bun from a bag. 


“The man ( or Topology is a relatively young branch of 
woman) in the maths: when Hilbert made his list of open 
street sets the problems, he included few problems from 


bar high.” topology. The field studies continuous defor- 
mations. The explanation goes that bagels 
and pretzels have different topologies because turning the former 
into the latter involves tearing the dough apart — not a continuous 
deformation. 

There is no word yet from the street on whether Hansson was 
successful in his explanation. But Gergonne would surely have 
approved: he studied projective geometry, which involves shapes with 
exotic topologies, such as surfaces that have only one side. He would 
also, surely, be astonished that his 1825 passer-by continues to pass 
by the labs and lecture rooms of the twenty-first century, ever-ready 
to test his or her comprehension. 

As for Einstein, Hilbert at one point raced him to the formulation 
of the general theory of relativity, only to later graciously concede. 
Hilbert is also credited with the dubious assertion that “Every boy in 
the streets of Géttingen understands more about four-dimensional 
geometry than Einstein’, which seems to be taking the principle of 
the passer-by as arbiter a little too far. (In fairness, he then went on to 
admit: “Yet, in spite of that, Einstein did the work, and not the math- 
ematicians.”) 

The new historical analysis solves one of the questions posed by 
Hilbert’s famous speech, but plenty remain. The celebrated 23 Hilbert 
problems inspired researchers for decades; some remain unsolved. 
Luckily, the passer-by can wait. When a solution appears, he or she 
will surely be ready. m 


Home from home 


We cannot look to the stars to solve our 
planet’s problems. 


orld events this year have been thought-provoking. One 
Wiens present in many minds might be: how are the plans 
for a Moon base or a Mars colony coming along? So, some 
much needed good news! US engineers have published their design for 
a nuclear reactor that could power a permanent off-planet settlement 
for 15 years (K. J. Schillo et al. Ann. Nucl. Energy 96, 307-312; 2016). 
In 2009 (and how long ago that seems now), NASA identified such 
a reactor design as essential if astronauts were to visit and spend time 
on Mars. It runs (but these are mere details) on low-enriched uranium 
ceramic-metal fuel and uses supercritical carbon dioxide — cleverly 
extracted from the Martian atmosphere — asa coolant. With a reli- 
able source of electricity, a Mars colony could then apply itself to what 
NASA calls in-situ resource utilization, and what everybody who saw 
the 2015 film The Martian will think of as growing potatoes in poo. 
Dont pack your bags for this off-world utopia just yet. For what kind 
of society will such an isolated outpost create? A series of papers in the 
journal Space Policy speculate on human affairs on the red planet, and 
reach some depressingly familiar conclusions. The costs of getting to 
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and living on the planet suggest the need for heavy corporate involve- 
ment, which could establish a conflict between those who want Mars 
to remain, well, Martian, and those who would develop and exploit its 
resources. Explorers must be able to plant a flag and claim territory 
(contrary to existing space law) to justify the trip — but, without restric- 
tions, a free-for-all on the Mars landscape could ruin it for everyone. 

The offered solution, naturally, is to divide Mars into exclusive 
economic zones, separated by a string of protected planetary parks. 
These would be safeguarded, naturally, by rules: no littering with space- 
craft parts, and no walking or driving except on designated tracks. 

Rules create a problem for colonists: whose interests do they serve, 
and who gets to decide them? A central Martian authority is a non- 
starter — attempts to impose a lunar government have stymied 
progress on a parallel Moon treaty — so some kind of tribunal system 
will be necessary, with appointments no doubt squabbled over by exist- 
ing and new powers. Finally, there is the question of what happens 
should the colonists decide that they are sick of being told what to do by 
their parent planet. How should Earth respond to a Martian rebellion 
or conflict? Would we intervene? Would we take refugees? 

One ‘pragmatic’ answer to all this likely division, researchers 
suggest, is the deliberate development ofa new Mars religion, especially 
for those born on Mars. This could emphasize the sense and purpose of 
the mission, and help to justify the difficult living conditions. 

The picture of Mars painted by these discussions, in other words, is a 
planet divided by politics, culture, religion, economics and inequality. 
Sound like anywhere you know? = 
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MICHAEL TEMCHINE 


WORLD VIEW .pecnisicor sen 


in 2008 and 2012, have now sent Donald Trump to the White 

House. It’s a shocking reversal that testifies to disaffection 
on many fronts. But it’s also a long-term political consequence of 
bipartisan trade, economic and innovation policies that have focused 
on growth and competitiveness while largely neglecting the negative 
impacts of technological change and globalization on quality of life in 
the American heartland. 

Trump built his campaign on the promise to help a middle class 
struggling from such dislocations to regain its footing. If he is serious, 
he will need to mobilize smart and creative science and innovation 
policies. Here are some. 

One rare bipartisan initiative of the past few years has been Manu- 
facturing USA, a network of nine regional centres 
focused on specific technological challenges such 
as 3D printing, photonics and functional fabrics. 
Each centre brings together private, government 
and university partners to help small firms to 
scale up and commercialize new technologies. 

The Trump administration should signifi- 
cantly build on this programme and integrate it 
with the Manufacturing Extension Partnership, 
which provides small manufacturers with techni- 
cal assistance. The overall goal should be regional 
innovation networks that draw from local univer- 
sities, talent and resources to build new indus- 
tries, expand manufacturing and create jobs. 

Initially, these and other programmes could 
be funded by Trump’s plan to repatriate profits, 
amounting to up to US$2 trillion, that US cor- 
porations have moved offshore. Trump wants 
to bring the money back to the United States by offering a one-time 
repatriation tax rate of 10% as an incentive. The resulting $200 billion or 
so of revenue is the equivalent of nearly three years of total non-defence 
government research and development spending. 

Repatriation of offshore pharmaceutical-company profits alone 
could generate $40 billion of revenue. This money could be spent on 
high-priority health-care problems relevant to Trump's agenda. First, 
an organization similar to the Defense Advanced Research Projects 
Agency is needed to bring academic, private-sector and government 
research and development capacities together to pursue high-risk, 
high-gain solutions to health problems such as obesity, diabetes and 
substance abuse, which are especially acute in economically depressed 
areas of the nation. Second, a national database should be created to 
integrate biomedical research data with electronic health records. Such 
a system is a prerequisite for a ‘learning’ health-care system that can use 
the huge amount of information generated both at the bench and in the 
clinic to deliver better health outcomes. Only government can do this. 

Trump is publicly sceptical about the seriousness of climate change, 


A merica’s ‘rust belt’ states, which gave Barack Obama their votes 


TRUMP COULD 


REIGNITE 


EFFORTS TO DEVELOP 


COAL PLANTS 


THAT CO-FIRE 


BIOMASS 


AND CAPTURE 
CARBON DIOXIDE. 


Science and innovation 
policies for Donald Trump 


Daniel Sarewitz says the new US president will need to promote creative 
science to benefit the disaffected millions who voted for him. 


and is committed to the coal industry. Yet climate change offers him the 
opportunity to steal one of the Democrats’ key issues while advancing 
the nation’s economic interests and energy security. As a supporter of 
nuclear energy, he could increase funding for research, development 
and demonstration of next-generation reactors — on which the United 
States lags far behind China and Russia — and reopen the licensing 
process for the Yucca Mountain waste-storage facility. 

Meanwhile, Trump could reignite efforts to develop coal plants that 
co-fire biomass and capture carbon dioxide — thus protecting jobs, 
creating economic opportunities for rural communities that provide fuels 
such as timber residues, and producing meaningful emissions reduc- 
tions. Power companies could in turn sell desperately needed CO, to 
the oil industry for advanced oil recovery — stimulating private-sector 
investment in infrastructure for CO, transport, 
while returning more CO, to the ground. Trump 
would barely have to soften his stance — just 
enough to acknowledge the pragmatism in meas- 
ures that both benefit the economy and reduce the 
threat posed by greenhouse gases. 

Once committed to an increasingly clean 
electricity grid, a Trump government could 
further invest in a national network of electric- 
vehicle recharging stations — thus solidifying the 
nation’s competitive advantage in this key tech- 
nology, and taking a giant step towards the elusive 
goal ofa low-carbon transport infrastructure. 

There are other ways for the anti-regulatory 
president-elect to create growth while protect- 
ing the environment. The US chemicals indus- 
try accounts for more than 10% of the nation’s 
manufacturing revenue and nearly 800,000 jobs 
ina highly competitive global market. A Trump regime could adopt 
the proven model of the Massachusetts Toxics Use Reduction Act, in 
which toxic chemicals are not directly regulated and banned, but firms 
work with government and academia towards assessing and adopting 
safer alternative chemicals and processes. Such an approach would 
help to protect the health and jobs of the manufacturing workers and 
communities Trump is concerned about, while stimulating innovation. 

There is much — much — to worry about with Trump’s election, 
starting with his apparent intolerance of fundamental tenets of civil soci- 
ety, such as rule of law and freedom of speech. But it’s also true that, as a 
political outsider, he is less beholden to constituencies that would limit 
the political options of more-mainstream candidates on both sides. As 
such, his presidency at least has the potential to open up productive new 
policy pathways. We will soon find out ifhe is serious about doing so. m 


Daniel Sarewitz is co-director of the Consortium for Science, Policy and 
Outcomes at Arizona State University, and is based in Washington DC. 
e-mail: daniel.sarewitz@asu.edu 
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RESEARCH HIGHLIGH 


PHOTONICS 


Nanolasers for 
precision imaging 


Researchers have turned 
nanowires into tiny lasers that 
could one day be used in high- 
resolution microscopy. 

The resolution of optical 
microscopes is normally 
constrained by the wavelength 
of light. To overcome this 
‘diffraction limit, a team 
led by Seok Hyun Yun at 
Massachusetts General 
Hospital in Cambridge 
made nanowires of lead 
iodide perovskite that were 
3-7 micrometres long and 
just 300-500 nanometres 
wide. When the team scanned 
the nanoparticles with a 
laser, they emitted light. The 
narrow width of the nanowires 
allowed their positions to 
be tracked with resolutions 
roughly five times better 
than the diffraction-limited 
resolution. 

Such nanolasers could one 
day be placed inside cells and 
yield super-resolution images 
from deep within tissue 
samples, the authors say. 

Phys. Rev. Lett. 117, 193902 
(2016) 


Plants take up 
more carbon 


Enhanced plant growth over 
the past decade seems to have 
slowed the build-up of carbon 
dioxide in the atmosphere. 
Despite the rise in CO, 
emissions resulting from 
human activity, atmospheric 
CO, levels have grown 
relatively little since 2002. 
To find out why, Trevor 
Keenan at Lawrence Berkeley 
National Laboratory in 
California and his colleagues 
used ground observations, 
satellite data and vegetation 
models to quantify changes 


Selections from the 
scientific literature 


ANIMAL BEHAVIOUR 


Plastic smells good to marine birds 


Marine animals often mistake plastic pollution 
for food, possibly because of its smell. 

Marine grazers such as krill consume 
microscopic organisms called phytoplankton, 
which release dimethyl sulfide (DMS). Some 
of krill’s seabird predators, such as petrels and 
shearwaters, sniff out this chemical to find the 
grazers. Matthew Savoca and Gabrielle Nevitt 
at the University of California, Davis, and their 
colleagues attached beads of the three most 


in CO, uptake and release by 
terrestrial plants worldwide. 
They found that increased 
photosynthesis and plant 
‘greening’ have boosted the 
amount of carbon stored 
on land. Reduced plant 
respiration due to the recent 
slowdown in the rate of global 
warming also seems to have 
increased this carbon sink. 
However, terrestrial carbon 
stocks will not offset the 
accumulation of heat-trapping 
gases in the atmosphere if 
emissions continue to grow, 
the scientists caution. 
Nature Commun. 7, 13428 (2016) 
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CANCER IMMUNOTHERAPY 


Gene-edited cells 
fight cancer 


Immune cells engineered to 
target cancer could provide an 
off-the-shelf therapy if results 
in mice can be replicated in 
people. 

Previous work has shown 
that engineered T cells called 
CAR-T cells can be taken from 
a patient and modified to kill 
some types of tumour, but the 
supply can be limited. To make 
a more reliable, universal 
source of donor cells, Carl 


common types of ocean plastic to buoys off the 
coast of California for three weeks. They detected 
DMS emitted from every seawater-exposed 
sample, but not from unexposed plastic. 

Data from other studies showed that predators 
that were most responsive to DMS also ingested 
the most plastic, which can poison animals and 
block their digestive systems (pictured is an 
albatross with plastic in its stomach). 

Sci. Adv. 2,e1600395 (2016) 


June, Yangbing Zhao and their 
colleagues at the University of 
Pennsylvania in Philadelphia 
used the genome-editing 

tool CRISPR-Cas9 to tweak 
human CAR-T cells. They 
disabled two proteins that 

can trigger immune rejection 
of donor T cells, resulting in 

a reduced reaction from the 
immune system when the cells 
were used in mice. 

The team also disrupted a 
protein called PD-1, which 
normally holds immune 
responses in check. The cells 
slowed the growth of human 
tumours implanted into mice. 


PAUL & PAVEENA MCKENZIE/GETTY 


CATHERINE CLARK 


S. F. CABREIRA ET AL./CURR. BIOL. 


Further testing of immune 
responses should be carried 
out in non-human primates, 
the authors say. 

Clin. Cancer Res. http://doi.org/ 
bsve (2016) 


| NEUROSCIENCE 
Brain circuit for 
tickling found 


Researchers have pinpointed 
a brain circuit in rats that fires 
when the animals are tickled. 
Rats are known to chirp or 
‘laugh when tickled. To find 
the neural mechanism, Michael 
Brecht and Shimpei Ishiyama 
at the Humboldt University 
of Berlin inserted electrodes 
into the somatosensory area of 
the rat brain, which responds 
to touch on the skin. Certain 
neurons in this region fired 
intensely when the rats were 
tickled on their bellies, but 
responded less to back-tickling 
and hardly at all to tail-tickling. 
By stimulating these 
neurons with the electrodes, 
the researchers caused the rats 
to chirp in the same way they 
do when tickled. 
Science 354, 756-759 (2016) 


CARDIOVASCULAR BIOLOGY 


Food chemical 
protects the heart 


A compound found naturally 
in legumes, grains and mature 
cheese helps rodents to 
maintain a healthy heart. 

The chemical, spermidine, 
extends the lifespans of yeast, 
flies and worms by promoting 
autophagy, a cellular process 
that degrades and recycles 
old components of the cell 
and toxic products. Frank 
Madeo at the University 
of Graz in Austria and his 
collaborators found that aged 
mice that regularly drank 
water containing spermidine 
had improved heart function 
compared to untreated mice 
of the same age, and that the 
compound lowered blood 
pressure in rats prone to 
developing heart failure. 
Spermidine reversed various 
age-related mechanical and 
metabolic changes in the heart 


cells of treated animals. 

The chemical seemed 
to exert its effects through 
autophagy, because it did not 
benefit mice with a genetic 
defect that impairs this process 
in heart-muscle cells. 
Nature Med. http://dx.doi. 
org/10.1038/nm.4222 (2016) 


Elastic polymer 
mimics muscles 


A lightweight elastomer can 
stretch and contract when 
stimulated by low voltages, 
making it a promising material 
for artificial muscles. 

Dielectric elastomers 
exhibit reversible physical 
deformations when stimulated 
by electricity, but previous 
attempts to make artificial 
muscles from such materials 
required cumbersome 
braces to prevent rupture 
during long stretches. To 
avoid using braces, Andrey 
Dobrynin at the University 
of Akron in Ohio, Sergei 
Sheiko at the University of 
North Carolina at Chapel Hill 
and their team developed a 
dielectric elastomer by linking 
together polymer strands in 
a bottlebrush-like structure. 
This resulted in a stronger and 
more stretchable material with 
a lower risk of rupture than 
current materials. 

The synthetic elastomers 
could eventually have 
applications in fields including 
soft robotics, the authors say. 
Adv. Mater. http://doi.org/f3s534 
(2016) 


Red squirrels 
get leprosy 


Two species of bacteria that 
cause leprosy have been found 
in red squirrels in Britain 

and Ireland — a surprising 
discovery because only 


RESEARCH HIGHLIGHTS BiiiSaiiaa¢ 


primates and armadillos were 
thought to get the disease. 

Stewart Cole at the Swiss 
Federal Institute of Technology 
in Lausanne, Anna Meredith 
at the University of Edinburgh, 
UK, and their team analysed 
tissue samples from 110 dead 
red squirrels (Sciurus vulgaris). 
Thirteen showed characteristic 
signs of infection such as 
skin lesions and swelling of 
the snout. The authors found 
leprosy-causing bacteria — 
Mycobacterium leprae or 
Mycobacterium lepromatosis 
— in all visibly ill animals 
(squirrel with leprosy on ear 
and snout pictured) and in 
about one-fifth of symptom- 
free ones. 

This could represent 
another threat to red squirrels 
in the United Kingdom and 
Ireland, which are already 
affected by other infections 
and by competition from grey 
squirrels introduced from 
North America. 

Science 354, 743-746 (2016) 


Early dinosaurs 
emerged gradually 


Dinosaurs and their precursors 
lived side by side in the earliest 
stages of dinosaur evolution. 
Max Langer at the 
University of Sao Paulo in 
Brazil and his colleagues 
explored some of the oldest- 
known rocks that contain 
dinosaur fossils, aged between 
237 million and 228 million 
years old. The team unearthed 
anew species of an early 
‘lizard-hipped’ dinosaur 
(Buriolestes schultzi; pictured 
right), which co-existed with 
anon-dinosaur relative, 
Ixalerpeton polesinensis (left). 
The non-dinosaur had skull 
and bone characteristics 
that resemble those of later 
dinosaurs, showing where key 
parts of dinosaur anatomy 
came from. The dinosaur’s 


teeth suggested that it ate small 
animals, unlike later species in 
its class that fed only on plants. 

The findings suggest that 
dinosaurs did not rapidly 
replace their predecessors 
during their early evolution. 
Curr. Biol. http://doi.org/bsww 
(2016) 


Old DNA reveals 
fishing habits 


Analysing ancient DNA from 
the tropics is difficult because 
DNA breaks down fairly 
rapidly in heat, but a team has 
managed to tease out some 
of the species represented by 
hundreds of 100-300-year-old 
fish bones from Madagascar. 
Short ‘barcode’ DNA 
sequences from a region of 
the genome that tends to vary 
between taxonomic groups can 
be used to identify fragmented 
animal bones discovered at 
archaeological sites. A team 
led by Alicia Grealy at Curtin 
University in Perth, Australia, 
applied this method to 887 
fish bones from a site on the 
west coast of Madagascar that 
was occupied by humans in 
the eighteenth and nineteenth 
centuries. The authors were 
able to identify 23 fish families 
— 10 of which could be 
narrowed down, with some 
confidence, to a specific species 
— including two kinds of shark 
and various coral-reef-dwelling 
fish. 
J. Archaeol. Sci. 75, 82-88 (2016) 
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SEVEN DAYS nescnisn 


EVENTS 


Bird flu in Europe 
A highly pathogenic strain of 
avian influenza virus called 
H5N8 has been identified in 
several European countries, 
threatening the region’s poultry 
industry, worth €30 billion 
(US$32 billion) a year. On 

12 November, authorities 
ordered the slaughter of 
30,000 chickens ona farm 

in northern Germany as 

a containment measure. 
Scientists believe that the 
strain was imported from Asia 
by migrating birds. The last 
epidemic of H5N8 in Europe 
was in November 2014; more 
than 336,000 birds were culled. 
H5N8 is not known to infect 
humans. 


Computing cancer 
A genomics powerhouse will 
team up with a supercomputer 
to unpick the tangled cellular 
networks that can cause 
resistance to cancer drugs. 

On 10 November, the Broad 
Institute of MIT and Harvard 
in Cambridge, Massachusetts, 
and IBM Watson Health 
announced a five-year, 
US$50-million study that 

uses IBM’s Watson artificial- 
intelligence system to analyse 
the genomes of thousands of 
drug-resistant tumours. The 
goal is to find patterns that 
could allow clinicians to predict 
drug sensitivity and resistance. 
The data are to be shared with 
the scientific community. 


Stem cells on offer 
Public prosecutors in Italy 

are investigating whether 
disgraced entrepreneur Davide 
Vannoni — who was convicted 
on criminal charges last year 
for administering unproven 
stem-cell therapies — is 
offering his procedures again. 
The terms of his conviction 
required him to refrain from 
performing procedures in 
Italy, and possibly from doing 


Carbon dioxide emissions stay stable 


The world’s carbon dioxide emissions have 
remained almost stable for the third year in 

a row, despite continuing economic growth. 
Global CO, emissions from fossil-fuel burning 
(pictured is a coal-fired power station) are set to 
total 36.4 billion tonnes this year, only marginally 
more than in 2015 and 2014, according to the 
annual Global Carbon Budget analysis released 


so abroad. However, patient 
groups on social media posted 
comments suggesting that 

the procedure is available 

in Georgia. The technique 
involves modifying stem cells 
from the bone marrow ofa 
patient or relative and injecting 
them back into the patient. 

In October, a patient came 
forward with a detailed account 
of his treatment, prompting 
prosecutors to take action. See 
page 340 for more. 


Open peer review 
A trial by Nature 
Communications shows that 
many scientists are willing to 
try open peer review. Around 
60% of the journal's authors 
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so far this year have agreed to 
have their reviews published, 
so the journal will continue to 
offer the option — but will not 
make it mandatory, it said on 
10 November. See page 343 for 
more. 


Wellcome boost 

The Wellcome Trust in 
London, one of the world’s 
largest medical charities, 
launched its open-access 
publishing platform on 

15 November. Wellcome Open 
Research allows Wellcome 
grant recipients to publish a 
variety of findings — including 
conventional articles, software, 
study protocols and living 

data sets — within days of 
submission. Publication is 
followed by transparent peer 


on 14 November. The flattening is largely the 
result of a drop in coal consumption in the 
United States and China, the world’s largest 
greenhouse-gas emitters. If warming is to be 
kept below 2°C, global CO, emissions must soon 
start to decline by around 3% per year, scientists 
reminded the United Nations climate meeting 
now under way in Marrakesh, Morocco. 


review in which authors choose 
their referees. The funder hopes 
that its initiative will change 

the way research is assessed, 
and that other grant-givers will 
follow its model. 


| RESEARCH 
LIGO restart 


After a ten-month shutdown 
for upgrades, the Laser 
Interferometer Gravitational- 
wave Observatory (LIGO) 

is ready to make discoveries 
again. In February, researchers 
stunned the world when they 
announced the detection of 
gravitational waves from a 
collision between distant black 
holes, and a second detection 
was confirmed in June. 

LIGO’s two 8-kilometre-long 


PATRICK PLEUL/AFP/GETTY 


ESA/DENMAN 


SOURCE: EUROPE INVEST AND MONEYTREE 


interferometers, one in 
Louisiana and the other in 
Washington state, began a 
new engineering run’ on 

14 November and expect to 
start science runs in December. 
Upgrades to the detectors 
should enable the observatory 
to scan a significantly larger 
volume of the Universe than 
before. 


PS SPACE 
Space debris 


A rocket stage from a Chinese 
launch apparently crashed. 
near ajade mine in northern 
Myanmar on 10 November. 
That morning, a Long 

March 11 rocket lifted off from 
the Jiuquan launch centre in 
Inner Mongolia; minutes later, 
it would have passed over 
northern Myanmar. Local 
media reported the sound of 
an explosion and a stench in 
the air when the object fell. 
The rocket successfully put five 
satellites into orbit, including 
one meant to test celestial 
navigation using X-rays 
emitted by pulsars. 


Beagle 2 worked 
Beagle 2 (pictured, artist's 
impression), which lost 
touch with Earth as it landed 
on Mars on Christmas Day 
2003, came extremely close 
to success, mission scientists 
announced on 11 November. 
By comparing images of the 
lander taken by NASAs Mars 


TREND WATCH 


Europe’ research commissioner, 
Carlos Moedas, has launched an 


effort to boost venture capital 
(VC). Last year, investors put 


€5.3 billion (then US$5.9 billion) 


into firms in Europe; US 


investment was $60.1 billion. To 
help close the gap, Moedas said 
on 8 November that the European 


Commission would offer €400 
million for new independently 


managed VC funds, provided they 


raised triple that amount from 


private investors. The funds would 
go to other VC funds, rather than 


company stocks. 


Reconnaissance Orbiter with 
simulations of how sunlight 
would reflect off the probe in 
various configurations, they 
found that Beagle 2 probably 
opened at least three of its 
four solar panels. The British- 
built lander, operated by the 
European Space Agency, may 
even have collected data, but 
the last panel's failure to fully 
open would have blocked 
transmission from the craft’s 
antenna, says the UK team 
behind the analysis, which is 
not yet published. 


President Trump 


The election of Republican 
businessman and reality- 
television star Donald Trump 
as the next president of the 
United States on 8 November 
sent shock waves around 

the world. Trump's views on 
women and minority groups, 
his stance on climate change 
and nuclear weapons, and his 
contradictory approach to 
evidence have worried many 


— including scientists. During 
his successful campaign 

to defeat the Democratic 
candidate Hillary Clinton, 
Trump pledged to remove 

the United States from the 

2015 Paris climate-change 
agreement and to dismantle the 
deal between six world powers 
and Iran to limit that country’s 
nuclear capabilities. See pages 
329, 331 and 337 for more. 


State initiatives 


Americans voted ona host 

of state-level initiatives on 

8 November, some science- 
related. Voters in Washington 
state rejected a carbon tax, with 
its backers saying they will 
bring a reworked proposal to 
the Washington legislature next 
year. The people of Montana 
rejected a plan that would have 
awarded up to US$200 million 
in grants to biomedical 
scientists over 10 years. And 
voters in southern Florida were 
split on whether to backa plan 
to release genetically modified 
mosquitoes to fight diseases 
such as the Zika virus. It was 


EUROPE’S VENTURE-FUNDING GAP 


Europe’s venture-capital investment severely lags 


behind that in the United States. 
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SEVEN DAYS | THIS WEEK | 


17 NOVEMBER 
Europe’ next four 
Galileo navigation 
satellites are scheduled 
to launch on an Ariane 5 
rocket from French 
Guiana, bringing the 
number of satellites in 
the constellation to 18. 
go.nature.com/2fmfxfs 


19 NOVEMBER 
The United States 
launches GOES-R, its 
first next-generation 
geostationary weather 
satellite, from Cape 
Canaveral, Florida. 
WWW.g0eS-F.g0V 


23 NOVEMBER 

The UK government 
makes its first major 
financial statement since 
the vote to leave the 
European Union in June. 


rejected by the town of Key 
Haven, where the field trial 
would take place, but voters in 
surrounding Monroe County 
approved it. Ultimate approval 
rests with the Florida Keys 
Mosquito Control District, 
which has yet to decide on the 
matter. 


Science adviser 


Canada will issue an open 

call for nominations for the 
position of national science 
adviser, its science minister 
announced on 10 November. 
The Canadian public, 
researchers and institutions will 
be asked to suggest candidates 
for the post, which Liberal 
Prime Minister Justin Trudeau 
promised to bring back after 
being elected in 2015. The post 
had been scrapped in 2008 by 
his Conservative predecessor, 
Stephen Harper. Trudeau has 
also vowed to usher in an era of 
evidence-based policymaking. 
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Italian stem-cell 
entrepreneur under fresh 
investigation p.340 


themed icons coming 
to aphone near you p.341 


The science- 


story p.342 


Declines in foreign 
aid threaten science success 


How much longer will 
the Southern Ocean delay 
global warming? p.346 


US president-elect Donald Trump (right) surveys the National Mall in Washington DC. 


The Trump experiment 


Researchers struggle to predict US president-elect’s impact on science. 


BY SARA REARDON, JEFF TOLLEFSON, 
ALEXANDRA WITZE & LAUREN MORELLO 


r | he long campaign for the White House 
is over — but incoming US president 
Donald Trump’s work is just starting. 

With about two months to go before his inau- 

guration on 20 January, he and his staff are 

busy vetting candidates for top government 
jobs and clarifying his agenda for governing. 
Some scientists have expressed fear about 
how Trump’s presidency will affect research 
in the United States. The president-elect has 
questioned the science underlying climate 
change and has linked autism to childhood 


vaccinations; the vice-president-elect, Indi- 
ana governor Mike Pence, does not believe in 
evolution or that human activities have caused 
climate change. Still, some science advocates 
caution against a rush to judgement about how 
the Trump administration will approach sci- 
ence and research issues. 

“The verdict remains out,’ says Tobin Smith, 
vice-president for policy at the Association of 
American Universities in Washington DC. 
“There are many people who have been strong 
supporters of science you might not have been 
expecting.” Smith says that a prime example is 
Newt Gingrich, the former Republican con- 
gressman for Georgia, who is rumoured to be 


up for a top job in Trump’s administration. 

As speaker of the US House of Representa- 
tives in the 1990s, Gingrich supported a plan to 
double the National Institutes of Health (NIH) 
budget over ten years. Since leaving Congress, 
he has advocated significant spending hikes 
for the National Science Foundation and other 
science-funding agencies. 

But it’s hard to draw any conclusions about 
Trump’s views on science, given his limited 
comments on such issues, other policy special- 
ists caution. “He speaks positively of innova- 
tion, but ‘innovation is a big word,” says Kevin 
Wilson, director of public policy and media 
relations at the American Society for Cell 
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> Biology in Bethesda, Maryland. “You can 
drive a truck through innovation. We don't 
know what that means.” 

Here, Nature looks at key science issues 
that Trump will confront during his first few 
months in office. 


BIOMEDICAL SCIENCE 

Trump has said that one of his priorities after 
taking office will be to reverse several executive 
orders that President Barack Obama, a Dem- 
ocrat, issued on topics ranging from climate 
change to immigration. Some biomedical 
researchers are worried that Trump will cancel 
an Obama order that authorized experiments 
with human embryonic stem cells. 

“Tt’s something very tangible,’ says Wilson. 
“He could do away with it on day one.” 

There's precedent for such actions: Obama's 
March 2009 executive order on stem cells 
reversed limits put in place by his predecessor, 
Republican George W. Bush. 

And Pence opposed Obama's decision to 
authorize research with human embryonic 
stem cells. “Tt is morally wrong to create human 
life to destroy it for research, he wrote in a 
March 2009 newspaper commentary. 

Overall, however, Trump has said little about 
biomedical science — aside from an oft-quoted 
2015 radio interview in which he called the 
NIH “terrible”. 

Mary Woolley, president of the advocacy 
group Research! America in Arlington, Vir- 
ginia, worries that biomedical science will not 
bea priority for Trump. “A lot of it is not really 
controversial,” she says. “We tend in this coun- 
try to take progress for granted.” 


CLIMATE CHANGE 

If Trump keeps his promises, the United 
States will reverse course on global warm- 
ing. The president-elect has blasted the US 
Environmental Protection Agency (EPA) 
and said that he will repeal Obama’s climate 
regulations. And Myron Ebell, a prominent 
climate sceptic who directs energy and global- 
warming policy at the Competitive Enterprise 
Institute in Washington DC, is leading Trumps 
transition team for the EPA. 

“I take Trump at his word,” says Jeffrey 
Holmstead, an attorney at the firm Bracewell 
in Washington DC who worked at the EPA 
under George W. Bush. “And I think they won't 
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SCIENTISTS SPEAK 


How researchers reacted to the election of Donald Trump. 


“I don’t actually know a Trump 
supporter who I could talk to 
about the election. How can 
Treach the public if I’m only 
speaking to my own circle?” 
Peter Peregrine, anthropologist at 
Lawrence University in Wisconsin and 
the Santa Fe Institute in New Mexico. 


“I feel sad for the United States 
and its scientists, and would 
like to welcome good scientists 
to work and live in China.” 

Yi Rao, neuroscientist at Peking University. 


have any difficulties.” 

Trump's first target will probably be the 
Clean Power Plan, Obama’s regulations to 
reduce greenhouse-gas emissions from power 
plants, which roughly two dozen states are 
challenging in court. The case is expected to 
reach the Supreme Court as early as next year. 
By then, Trump may have filled the court's cur- 
rent vacancy with a conservative justice. That 
would put the climate regulations in jeopardy. 

But it would be easy enough for the Trump 
administration to just revoke the Clean Power 
Plan on its own, Holmstead says. 

Trump’s vow to pull the United States out 
of the Paris climate accord — a process that 
could take four years to complete — is still 
sinking in at the United Nations climate talks 
in Marrakesh, Morocco. Delegates there are 
busy hashing outa plan to implement the Paris 
agreement. “Were sort of in the denial stage,” 
says Jake Schmidt, international programme 
director at the Natural Resources Defense 
Council in New York City. “T suspect there will 
be some disappointment and anger starting to 
bubble up in the next couple of days.” 

Already, many nations are looking to China 
for leadership on climate. The country leads 
the world in renewable-energy investment 
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“Political events do not and 
cannot change the reality of 
climate change.” 

Philip Duffy, president of the Woods 
Hole Research Center in Falmouth, 
Massachusetts. 


“As a Canadian working at a 
US university, a move back to 
Canada will be something I’Il 
be looking into.” 

Murray Rudd, studies environmental 
economics and policy at Emory 
University in Atlanta, Georgia. 


For more reactions, see go.nature.com/2fu4c) 55 


because it views clean energy as a neces- 
sity and an opportunity, says Andrew Steer, 
president of the World Resources Institute, an 
environmental think tank in Washington DC. 
He hopes that Trump will come to see tough 
climate-change policies as a tool to ensure that 
the United States remains competitive in the 
development of energy technology. 


SPACE 

Trump himself has said little about space 
policy, but astronaut Eileen Collins — the first 
woman to command a space shuttle — spoke 
at the Republican national convention in July. 
Collins called for the United States to reassert 
its leadership in space exploration. 

In October, two of Trump’s campaign advis- 
ers wrote a pair of commentaries in SpaceNews 
laying out possible directions for space policy 
under a new president. The articles argued 
that NASA should focus more on deep-space 
exploration and less on what they called “polit- 
ically correct environmental monitoring”. 

NASA’ Earth-observing missions account 
for more than one-third of the agency’s 
science budget, an expense that has come 
under fire from congressional Republicans. 
“It’s conceivable that the Trump White House 
could go after NASA Earth science,” says John 
Logsdon, former director of the Space Policy 
Institute at the George Washington University 
in Washington DC. 

The Trump advisers also argued for more 
public-private partnerships in civilian space. 
Such efforts are already under way with private 
companies now ferrying US cargo, and soon US 
astronauts, to the International Space Station. 

Casey Dreier, director of space policy for 
the Planetary Society in Pasadena, California, 
says that space is likely to be a low priority for 
Trump during his first 100 days as president. 


SPACEX 


Falcon9 rockets built and launched by SpaceX 
carry cargo to the International Space Station. 


Dreier will be watching whether the new Con- 
gress cuts government spending. “If that’s the 
case, NASA will be impacted by that along with 
every federal agency,’ he says. 


IMMIGRATION 

Trump reinvigorated the national debate on 
immigration with his campaign pledges to 
build a wall along the US border with Mexico 
and to temporarily ban Muslims from entering 
the United States. 

“Our hope is that the rhetoric of the elec- 
tion was only a facade for something hopeful 
that’s going to be more pragmatic and engaging 
communities,” says Carl Saab, a neuroscientist 
at Brown University in Providence, Rhode 
Island, and the former president of the Society 
for Arab Neuroscientists. 

Trump has variously said that the ban would 
apply to all Muslims and to anyone from 
“nations tied to Islamic terror’, drawing vigor- 
ous criticism from civil-liberties groups that 
say such a policy would violate the US Consti- 
tution. He has also proposed deporting more 
people who are in the United States illegally, 
which could include those who came to the 
country as children. 

Some researchers worry that such policies 
would threaten US research dominance. About 
5% of US university students come from other 
countries, including more than 380,000 people 
studying science, engineering, technology or 
mathematics. 

“The rhetoric that Mr Trump ran under has 
frightened lots of immigrants,’ says Benjamin 
Corb, director of public affairs for the American 
Society for Biochemistry and Molecular Biology 
in Rockville, Maryland. “I certainly hope that 
we don't end up losing some brilliant minds as 
a result of some near-sighted policies.” = 
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Why the polls 
missed Trump 


Pollsters lament failure to foresee outcome of US election. 


BY RAMIN SKIBBA 


hat went wrong? That's the question 
many political pollsters in the 
United States are asking them- 


selves in the aftermath of the 8 November 
presidential election. Republican candidate 
Donald Trump won in an electoral landslide, 
but for months most polls forecast a victory 
for his Democratic opponent, Hillary Clinton. 

Many types of poll, including randomized 
telephone polls and online polls that people 
opt into, indicated a tightening of the gap 
between the two candidates in the weeks lead- 
ing up to the election — but still pointed toa 
Clinton win. “The industry is definitely going 
to be spending a lot of time doing some soul- 
searching about what happened,” says Chris 
Jackson, head of US public polling at Ipsos, 
a global market-research and polling firm 
based in Paris. 

The most recent national polls — including 
those conducted by ABC News/ Washington 
Post, Ipsos, YouGov and Fox News — all esti- 
mated a Clinton lead 


of 3-4% over Trump. “It seems like 
Yet as the last votes Trump voters 
are being counted, Wweremore 
Clinton leads the pop- enthusiastic 
ular vote by a razor- about turnout 
thin margin: just0.2%. and less 

The majority of states enthusiastic 
have tipped for Trump, about 

giving him their valu- responding 
able electoral-college fg polls.” 


votes and ensuring his 
victory. These include several Midwestern 
states that Clinton was expected to win. 

Poll aggregators such as FiveThirtyEight 
nonetheless forecast Clinton’s chances of 
victory at 71% or higher. This dramatic poll- 
ing failure could have been due to poorly 
assessed voters, people misreporting their 
voting intentions, or pollsters inadequately 
surveying some segments of the population. 

“Tt’s a big surprise that such a wide vari- 
ety of polls using such a wide variety of 
methodologies have all the errors fall in the 
same direction,” says Claudia Deane, vice- 
president of research at the Pew Research 
Center in Washington DC. 

The University of Southern California 
Dornsife/Los Angeles Times presidential 


election poll, which included an online panel 
of nearly 3,000 people, was the only major 
national poll to forecast a Trump lead days 
before the election. “But we're not sure we were 
right either,’ says Jill Darling, survey director 
at the university’s Center for Economic and 
Social Research in Los Angeles. She notes that 
Trump did not defeat Clinton by 3%, as her 
groups most recent poll predicted. 

With each election, pollsters have a harder 
time reaching people. Now that Americans 
have fewer landlines and more mobile phones 
with caller ID, they don't respond to calls from 
unfamiliar numbers. Online surveys also 
struggle to recruit participants. A poll gener- 
ally needs at least 1,000 participants who are 
representative of the general population with 
respect to gender, race, education, income 
level and geographic distribution to produce 
statistically significant results. 

Pollsters strive to assess not just who 
supports whom, but also who will be likely to 
vote. This year, 119 million people cast bal- 
lots, accounting for 55.6% of registered voters, 
according to Michael McDonald, a political 
scientist at the University of Florida in Gaines- 
ville. That is the lowest percentage since 2000. 

There were also more undecided voters this 
year than in previous presidential elections. 
Such voters may be under-represented in 
polls, yet tilt towards one candidate, Darling 
says. Only 53% of poll respondents disclosed 
who they would vote for, lower than the 70% 
in earlier elections, she adds. And people 
overestimate their own likelihood of voting. 

“It seems like Trump voters were more 
enthusiastic about turnout and less enthu- 
siastic about responding to polls. That’s a 
deadly combination,’ says Andrew Gelman, a 
statistician and political scientist at Columbia 
University in New York City. 

Polling experts in Britain conducted a 
formal inquiry following polling failures in 
last year’s general election, when polls under- 
estimated the turnout of older, Conservative 
voters. Now, in the United States, the Ameri- 
can Association for Public Opinion Research 
has already named an ad-hoc committee to 
dig into the data and conduct a post-mortem 
on the election polls. They aim to produce 
findings by next May, Deane says. m 
See go.nature.com/2f9hpeo for a longer 
version of this story. 
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Davide Vannoni during court proceedings against his Stamina Foundation in 2014. 


REGENERATIVE MEDICINE 


Stem-cell scandal 
gets fresh scrutiny 


A treatment banned in Italy may have resurfaced abroad. 


BY ALISON ABBOTT 


ublic prosecutors in Turin, Italy, are 
Pp investigating whether disgraced 

stem-cell entrepreneur Davide Van- 
noni— convicted on criminal charges last year 
for administering unproven stem-cell therapies 
in Italy — is offering his treatments again, this 
time in eastern Europe. 

In March 2015, Vannoni was convicted on 
charges of conspiracy and fraud related to his 
treatments, which had been declared danger- 
ous by the Italian Health Authority (ATFA). His 
case was a cause célébre among Italian scientists, 
who fought for many years to stop him adminis- 
tering stem cells to patients through his Stamina 
Foundation (see Nature 518, 455; 2015). 

Vannoni was sentenced to 22 months in 
prison, but the sentence was suspended in a 
plea bargain drawn up by prosecutor Raffaele 
Guariniello, who said that the terms required 
Vannoni to refrain from organizing further 
therapies — either in Italy or abroad. Soon 
after the plea bargain, patient groups on social 
media posted comments that the stem-cell 
treatment was available once more, in Geor- 
gia. And in late October, a patient came for- 
ward with a detailed account of his treatment. 
Prosecutors told Nature that this has prompted 
them to take action. 

Vannoni did not respond to requests for 
comment, forwarded through his lawyer, 
Liborio Cataliotta. But Cataliotta told Nature 


that, although he did not know whether Van- 
noni was continuing therapies in Georgia, he 
thought there were “technical defects” in what 
Guariniello had said about the terms of the plea 
bargain. The prosecutor had no authority to rule 
on what might be carried out in countries where 
such stem-cell therapies are allowed, he said. 

Guariniello has since retired, but another 
prosecutor in Turin, Vincenzo Pacileo, told 
Nature that the deferred sentence hinges on 
the premise that Vannoni “gives up” his stem- 
cell activities, a phrase that Pacileo says implies 
both in Italy and abroad. 


However, a senior Ital- “Itis really 

ian legal expert whohas disappointing 
read the 46-page sen- _ that science 
tence, but doesnot wish has not been 
tobe named, saysthatthe able to put an 


premise is legallyambig- end tothis.” 
uous. Nowhere does the 

document specify that this applies to activities 
abroad, he says. 

“Tt is really disappointing that science has not 
been able to put an end to this,” says Luca Pani, 
a former director-general of AIFA. “A patient is 
a patient wherever they are treated, and I worry 
for them and their families.” Pani tried to close 
down Stamina’s operations in Brescia in 2012, 
but desperate patients got court orders allow- 
ing them to be treated. It took two years before 
magistrates in Turin ordered the confiscation of 
equipment and materials from the Stamina lab. 

In July this year, the Italian news service 
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ANSA reported that patients had claimed to 
have received the Stamina therapy in Georgia. 
It did not identify them or detail their clinical 
experience. But on 24 October, 52-year-old 
Andrea Zicchieri appeared in a tell-all report 
on the Italian television news channel La7, com- 
plete with video documentation. Zicchieri, who 
has a motor-neuron disease, said he had paid 
€18,000 (US$20,000) for a cycle of three treat- 
ments in Tbilisi between July and September 
that had been arranged with Vannoni. Zicch- 
ieri told La7 that his brother had gone to Tbilisi 
in May to donate bone-marrow cells that were 
used in the treatment, and that Vannoni was 
present while the treatment was carried out. He 
said that the treatment had not worked. 

Zicchieri, who has now been called to give 
evidence to the police, repeated the same 
details to Nature, providing screenshots of 
messages between himself and Vannoni, and 
adding that he had sought out Vannoni know- 
ing that the treatment was controversial. “Even 
if there was only 1% chance of hope, I wanted 
to take that chance,” he said. 

His account is consistent with talk on the 
social-media accounts of patient groups that 
support Vannoni’s work. According to photo- 
graphs supplied by Zicchieri, the clinic was the 
Mardaleishvili Medical Center in Tbilisi, which 
also features on a patient-group site dedicated 
to the Stamina treatment. When Nature rang 
the centre for comment, a man said that Van- 
noni had been there and would return in Janu- 
ary, and that all discussions about stem-cell 
therapy should go through Vannoni and not 
the clinic. He didn't answer on further attempts 
to make contact. 

Postings on two of Vannoni’s Facebook 
pages — previously reported in Italian 
media — suggest that he is now working abroad. 
In February, in response to a question about 
his whereabouts, Vannoni posted that he was 
“abroad to continue Stamina” On another page, 
he posted a message in June thanking people for 
their birthday wishes, and noting that “some- 
times one must abandon a battle to win a war. In 
short — to change one’s country. #750.” Under- 
neath the post, commenters said that the #750 
means that Vannoni is now claiming to have 
made 750 stem-cell infusions — many more 
than he had made in Italy. (An earlier message, 
posted on 24 December 2015, ends “#450”). 

Vannoni’s therapy involves modifying stem 
cells taken from the bone marrow ofa patient or 
relative, and injecting them into the patient. In 
Italy, Vannoni has used this approach in people 
with a range of conditions, from Parkinson's dis- 
ease to muscular dystrophy. Recent cases of sim- 
ilar treatments have led to cancers, notes Elena 
Cattaneo, a neuroscientist at Italy's University 
of Milan who was among those who worked to 
stop Vannoni and who is now an Italian senator. 
That Vannoni has apparently started again is “a 
disgrace’, she says. “Governments and health 
institutes should do more to inform patients 
about these sorts of therapies.” m 
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| CAN HAZ MORE SCIENCE EMOJI? 
Host of nerd icons proposed. 


BY KATHERINE BOURZAC that oversees the list, text messages could soon feature DNA, a mole or 
a conical flask. Existing official science emoji include a telescope and a 

Texting is set to reach new heights of scientific expression. A group of magnifying glass. Group members say that they want scientists to see 

science enthusiasts and designers is proposing to add new science their interests reflected in the official emoji offerings. m 

emoji to the official list of the icons. If approved by the organization See go.nature.com/2g6ei5e for a longer version of this story. 
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| NEWS IN FOCUS 


Rwanda has improved the quality of its home-grown science. 


DEVELOPMENT 


Donors slash aid 
to Rwanda 


Progress on public health and science prompts a shift in 
funding to more-troubled nations. 


BY ERIKA CHECK HAYDEN 


wanda has made major public-health 
Rites since the country’s genocide 

against the Tutsi people ended in June 
1994, but declines in foreign aid now threaten 
that progress. 

Donors such as the US President’s Emer- 
gency Plan for AIDS Relief and the Global 
Fund to Fight AIDS, Tuberculosis and Malaria 
have reduced assistance to Rwanda by 40% over 
the past three years. The situation will be a hot 
topic at the annual meeting of The World Acad- 
emy of Sciences in Kigali on 14-17 November. 

“If the decline in funding continues, there 
are a lot of things to lose rather than to gain,” 
says Sabin Nsanzimana, who manages initia- 
tives on HIV and other blood-borne diseases 
at the Rwanda Biomedical Center in Kigali, 
which runs the country’s health programmes. 

The declining foreign aid is part of two 
broader trends in development: redirect- 
ing money to countries that have the highest 
number of sick people, and urging developing 
countries to fund more of their own develop- 
ment work. The former has reduced aid to 
Rwanda, a small country that has slashed the 


incidence of diseases such as HIV. 

Like many other developing nations, Rwanda 
doesn’t have the resources to move money 
from other priority areas such as education 
into health to fill the aid gap, says Nsanzimana. 
Even if sub-Saharan African nations were to 
more than triple their spending on HIV in the 
next five years, he notes, a recent study found 
that most could only raise half the money they 
need to end the epidemic’s global threat by 2030 
— a goal set by the Joint United Nations Pro- 
gramme on HIV/AIDS (M. Remme et al. Soc. 
Sci. Med. 169, 66-76; 2016). 

Fredrick Kateera, director of research for 
the Rwanda office of the non-profit organiza- 
tion Partners in Health, says that funding cuts 
could imperil the research that is needed to 
fight diseases such as malaria in Rwanda and 
elsewhere. “Setting up surveillance systems 
costs just as much money as just giving out 
drugs and bed nets,” he says. 

Rwanda has long been seen as a prime exam- 
ple of how science can aid development. After 
the genocide in 1994, President Paul Kagame 
invested in building roads, developing high- 
speed Internet access and applying science to 
local problems. The country slashed maternal 


342 | NATURE | VOL 539 | 17 NOVEMBER 2016 
© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


and infant deaths, new HIV infections, AIDS 
deaths and mother-to-child HIV transmission 
(S. Nsanzimana et al. BMC Med. 13, 216; 2015). 

Kagame has also used his authority to ensure 
that science conducted by local and foreign 
researchers promotes domestic development. 
In 2012, the country’s ministry of health pub- 
lished guidelines compelling all foreign sci- 
ence projects to strengthen Rwandan research 
capacity, for example by training its scientists 
or building infrastructure. Rwandan research- 
ers routinely appear as first and last authors on 
studies conducted in the country, in contrast to 
other African nations where local researchers 
often don't benefit from foreign collaborations. 

“You can't just be a global health researcher 
who drops in, gets some data, publishes it with 
your name as first author and never comes 
back,” says epidemiologist Edward Mills of the 
Institute for Health Metrics and Evaluation in 
Seattle, Washington, who is an adjunct profes- 
sor at the University of Rwanda in Kigali. 


HARD LINE ON HEALTH 

Staff at clinics and ministries who fail in their 
roles to meet stringent health targets can also 
be reassigned or sacked. In July, for instance, 
Kagame removed Agnés Binagwaho from her 
post as minister of health. The highly regarded 
paediatrician earned the US$100,000 Roux 
Prize in April 2015 for using data to improve 
public health, but was let go after malaria 
cases in the country quadrupled to 2 million 
between 2012 and 2015. 

Human-rights groups have chafed at 
Kagame’s authoritarian tendencies, but he has 
kept corruption low in Rwanda compared with 
other sub-Saharan African nations. This made 
the country a favourite of donors for much of 
the 2000s. But that shifted after 2013, when 
organizations such as the US Institute of Med- 
icine questioned donors’ generosity towards 
Rwanda over nations with much higher HIV 
burdens. 

In response, donors recalibrated. In 2014, 
the Global Fund to Fight AIDS, Tuberculo- 
sis and Malaria began using a new formula 
that allocated funding in part according toa 
country’s burden of disease. In June, the fund 
said that it had reworked its formula again in 
response to protests from countries such as 
Rwanda that argued they were being punished 
for their success. 

Despite the funding constraints and bureau- 
cratic restrictions, Rwandan researchers who 
train abroad often return home. Kateera 
earned his medical degree in Uganda, where 
universities have partnerships with prestigious 
institutions in Europe, the United States and 
Asia. With one-third of the population and 
one-ninth the area of Uganda, Rwanda doesn't 
offer the same opportunities. But Kateera feels 
that he can make a difference in Rwanda: “You 
can make a big impact and measure it much 
more easily here compared to ina larger coun- 
try,’ he says. m 
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SCIENCE PUBLISHING 


IN FOCUS | NEWS 


Reviews open up 


Trials suggest growing support for transparent peer review. 


BY EWEN CALLAWAY 


hen Kevin Sinclair reported that 
13 cloned sheep his lab had stud- 
ied lived long and healthy lives, he 


wanted to be as transparent as possible about 
what has been a controversial research area. 
Sinclair had invited journalists to see the 
sheep while the experiment was in progress at 
the University of Nottingham’s School of Bio- 
sciences in Loughborough, UK. And when his 
paper was published this July, he agreed that 
its peer-review reports should also be made 
public (K. D. Sinclair et al. Nature Commun. 
7, 12359; 2016). 

The developmental biologist was taking part 
ina trial by Nature Communications, in which 
the journal offered authors the option to have 
their reviews published. The goal was to find 
out whether scientists would see the practice 
as a way to make research more reliable and 
egalitarian — or asa needless nod to transpar- 
ency that could harm peer review. 

A few journals, such as Peer], the BMJ and 
F1000Research, already embrace open peer 
review in different forms. Some forbid it. Other 
publishers and journals, including Nature 
Communications, are treating the practice as 
a frigid swimming pool: they are dipping their 
toes in the water, but are reluctant to plunge in. 

So far, scientists seem willing to give open 
peer review a try. On 10 November, Nature 
Communications announced that around 
60% of its authors in 


2016 had agreed to “It’s about 
have their reviews making the 
published, and that process fairer 
it would continueto andmore 


offer scientists the 
option — although 
would not make it mandatory. (Reviewers can 
choose to withhold their names, but cannot 
otherwise influence the process, apart from 
declining to take part in an ‘open review paper.) 

Meanwhile, an unpublished online survey 
funded by the European Commission (EC) has 
found that more than half of its 3,062 respond- 
ents thought that open peer review should 
become routine, although they expressed some 
qualms about specifics. One challenge is that 
open peer review means different things to 
different people, says Anthony Ross-Hellauer, 
an information scientist at the Géttingen State 
and University Library in Germany, who ran 
the survey for the EC-funded ‘OpenAIRE’ 
project on open science. Some think that it 
implies only naming the reviewers, but not 


transparent.” 


making their reports available, whereas others 
think that unsigned reports should be public. 
“Tt makes it really difficult to talk about what 
works in what circumstances if we're not using 
the same language,’ Ross-Hellauer says. 

Some scientific communities seem to 
embrace open-review reports more read- 
ily than others, notes Joerg Heber, executive 
editor of Nature Communications. During 
his journal’s trial, authors on more than 70% 
of eligible papers in ecology and evolution, 
molecular biology and Earth sciences adopted 
open reports, whereas physics papers saw the 
lowest uptake. 


OPEN EXPERIMENTS SPREAD 

For nearly two years, Dutch publisher Elsevier 
has published unsigned peer-review reports for 
five of its titles. On the basis of that trial, Else- 
vier plans to bring open review to other journals 
next year. And some journals allow reviewers 
to post their pre-publication reviews at other 
websites. One of those sites is Publons.com, 
which encourages scientists to make their peer 
reviews public, ifjournals permit. 

Advocates say that the benefits of open peer 
review are straightforward. “It's about mak- 
ing the process fairer and more transparent, 
so people can be held accountable if some- 
thing goes wrong,’ says Jonathan Tennant, a 
palaeontologist at Imperial College London 
and communications director at ScienceOpen, 
an open-research publishing network. But 
Stephen Heard, an ecologist at the University 
of New Brunswick in Fredericton, Canada, has 
misgivings. He worries about scientists posting 
peer-review reports without the knowledge or 
permission of reviewers. Heard also says that 
if he knew his reviews would end up public, it 
would make the job harder because he would 
feel obliged to cut back on technical language. 
“I would inevitably do fewer reviews,” he says. 

It’s also not clear who actually reads the 
reports once they are made public, Heard 
says. Heber says that download figures will 
be assessed by Nature Communications. (The 
journal is published by Nature’s publisher, 
Springer Nature; Nature’s news and comment 
team is editorially independent of the pub- 
lisher’s research editorial teams.) 

Sinclair has not yet heard from anyone who 
has read the review report for his paper. And 
although his lone experience was encour- 
aging, he is not ready to be an evangelist for 
open peer review. “I think most people, and 
I would include myself, are apprehensive,” 
Sinclair says. m 
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eserved. 


The waters of the Southern Ocean have delayed 
global warming by absorbing much of the excess 
heat and carbon generated by humanity. 


But that might not last. 


BY JEFF TOLLEFSON 


an expedition south of New Zealand. “It felt like the ship would be crushed each time we rolled 
into a mountain of water,’ recalls Russell, an ocean modeller at the University of Arizona in 
Tucson. At one point, she was nearly carried overboard by a rogue wave. 

But what really startled her was the stream of data from sensors analysing the seawater. As the 
ship pitched and groaned, she realized that the ocean surface was low in oxygen, high in carbon 
and extremely acidic — surprising signs that nutrient-rich water typically found in the deep sea 
had reached the surface. As it turned out, Russell was riding waves of ancient water that had not 
been exposed to the atmosphere for centuries. 

Although controversial when she encountered it back in 1994, this powerful upwelling is now 
recognized as a hallmark of the Southern Ocean, a mysterious beast that swirls around Antarctica, 
driven by the world’s strongest sustained winds. The Southern Ocean absorbs copious amounts 


J oellen Russell wasnt prepared for the 10-metre waves that pounded her research vessel during 
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Strong winds help to 
pull up ancient water 
from the ocean bottom. 


COLIN MONTEATH/MINDEN/NGC 


of carbon dioxide and heat from the atmosphere, which has slowed the 
rate of global warming. And its powerful currents drive much of the 
global ocean circulation. 

The hostile conditions have kept oceanographers at bay for decades, 
but a new era of science is now under way. Researchers from around the 
world are converging on the region with floats, moorings, ships, glid- 
ers, satellites, computer models and even seals fitted with sensors. The 
goal is to plug enormous data gaps and bolster understanding of how the 
Southern Ocean — and the global climate — 
functions. Doing so could be key to improv- 
ing predictions of how quickly the world will 
warm, how long the Antarctic ice sheet will 
survive and how fast sea levels will rise. 

“Tt’s been amazing to see this explosion 
of information,’ says Arnold Gordon, an 
oceanographer at Lamont-Doherty Earth 
Observatory in Palisades, New York, who 
led some of the early Southern Ocean sur- 
veys in the 1960s. “New technologies are 
allowing us access to these remote areas, 
and we are far less dependent on driving a 
ship through the sea ice” 

Already, initial data from an array of 
ocean floats suggest that upwelling waters 
could be limiting how much CO, the South- 
ern Ocean absorbs each year. This raises 
new questions about how effective these 
waters will be as a brake on global warming 
in decades to come. 

“The Southern Ocean is doing usa big cli- 
mate favour at the moment, but it’s not necessarily the case that it will con- 
tinue doing so in the future,’ says Michael Meredith, an oceanographer 
with the British Antarctic Survey in Cambridge, UK. Meredith is heading 
aseries of expeditions over the next five years to help document the uptake 
of heat and carbon. “It really is the key place for studying these things.” 


TRACKING CARBON 

The mysteries of the Southern Ocean have beckoned explorers for 
centuries, but the unique geography of the region makes it a perilous 
place for ships. There are no landmasses to tame the winds and waves 
that race around the planet at 60°S. And the ice surrounding Antarctica 
is notorious for engulfing wayward vessels, including Ernest Shackleton’s 
Endurance in 1915. 

Scientists only started to realize how important the region is for 
controlling global climate in the 1980s, when several groups were trying 
to explain what had caused atmospheric CO, concentrations to drop by 
about one-third during the last ice age and then later rise. Oceanogra- 
pher Jorge Sarmiento at Princeton University in New Jersey realized that 
changes in circulation and biology in the Southern Ocean could help to 
cool and warm the planet'. 

Three decades later, Sarmiento is leading an effort to gather the first 
real-time data on the chemical and biological processes that govern 
carbon in the Southern Ocean. The US$21-million Southern Ocean 
Carbon and Climate Observations and Modeling Project (SOCCOM) 
has already deployed 51 ofa planned 200 robotic floats that bob up and 
down in the upper 2,000 metres of the Southern Ocean. Building on the 
global Argo array, which consists of more than 3,700 floats collecting 
temperature and salinity data, the SOCCOM floats also measure oxygen, 
carbon and nutrients. 

With the new data, Sarmiento and his team can test their models 
and refine estimates of how CO, moves between the seas and the sky. 
Indirect evidence suggests that the Southern Ocean isa net carbon sink 
and has absorbed as much as 15% of the carbon emissions emitted by 
humanity since the industrial revolution. But at some times of year and 
in specific places in this region, carbon-rich surface waters release CO, 
into the atmosphere. 


“THE SOUTHERN 
OCEAN IS DOING US 
A BIG FAVOUR AT THE 
MOMENT, BUT IT'S 
NOT NECESSARILY 
THE CASE THAT IT 
WILL CONTINUE.” 


FEATURE | NEWS 


Now, researchers are getting some of their first glimpses in near-real 
time of what happens in the Southern Ocean, particularly in winter. 
“Right off the bat, we are seeing CO, fluxes into the atmosphere that are 
much greater than we had estimated before,’ Sarmiento says. “It’s just 
revolutionary.’ 

The unpublished analysis is based on just 13 floats that have been in 
the water for at least a year, so the question now is whether the higher 
CO, emissions during winter represent larger trends across the entire 
Southern Ocean. 

“It's pretty tantalizing,” says Alison Gray, 
a postdoctoral researcher at Princeton who 
is leading the study. “It would imply that 
potentially there is a much weaker carbon 
sink in the Southern Ocean than has been 
estimated.” 

Hints of something similar have been 
seen before. In 2007, a team led by Corinne 
Le Quéré, now director of the Tyndall Cen- 
tre for Climate Change Research in Nor- 
wich, UK, published a study in Science’ 
indicating that the rate of carbon uptake 
by the Southern Ocean decreased between 
1981 and 2004. The authors blamed the 
changes on the winds that encircle the Ant- 
arctic continent. The speed of those winds 
had increased during that time, probably as 
a result of the hole in the stratospheric ozone 
layer over Antarctica and possibly because 
of global warming. Stronger winds are better 
able to pull up deep, ancient water, which 
releases CO, when it reaches the surface. That would have caused a net 
weakening of the carbon sink. 

If that trend were to continue, atmospheric CO, levels would rise even 
faster in the future. However, a study in Science’ last year found that the 
carbon sink started to strengthen in the early 2000s (see “The unreliable 
sink). 

Le Quéré says it’s unclear whether that rise in CO, absorption is a 
return to normal or a deviation from the long-term weakening of the 
sink. Regardless, she says, it’s now clear that the Southern Ocean might 
be much more fickle than scientists thought. 

SOCCOM floats will probably help researchers to answer these ques- 
tions, but it could be years before they can say anything concrete about 
trends. Nor is Le Quéré convinced that the new network of floats will 
provide enough detail. In a paper published in July*, she found that mod- 
els of carbon uptake by the Southern Ocean depend strongly on assump- 
tions about the structure of the food web there. She says that climate 
scientists need to improve their understanding of the type and timing 
of phytoplankton and zooplankton blooms if they are going to get their 
climate projections right. “In my view, that’s the next frontier,’ she says. 


WARMING WATERS 

Carbon is only part of the story in the Southern Ocean. Scientists are also 
beginning to pin down what happens to all the heat that gets absorbed 
there. 

The Southern Ocean is the starting point for a network of currents that 
carry water, heat and nutrients throughout the ocean basins. Near Ant- 
arctica, surface waters normally grow cold and dense enough to sink to 
the bottom of the ocean, forming abyssal currents that hug the sea floor 
as they flow north into the Pacific, Atlantic and Indian oceans. 

Much of what scientists know about these currents comes from ship 
surveys conducted every decade or so since the early 1990s. In 2010, 
when researchers analysed data from the surveys, they found a pro- 
nounced warming trend in abyssal waters, which were somehow absorb- 
ing about 10% of the excess heat arising from global warming”. 

The level of warming in the deep ocean came as a surprise, and 
researchers have proposed several explanations that centre on the 
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THE UNRELIABLE SINK 


The amount of carbon dioxide absorbed by the Southern Ocean has fluctuated 
markedly. In the 1990s, the region lost some of its capacity to take up the 
greenhouse gas. It started to suck in more in 2002, but this trend could reverse. 
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Southern Ocean. One factor could be that surface waters around 
Antarctica have become less salty, in part because of an increase in 
summer rainfall over the ocean. Fresher surface water is less dense, 
so that change would choke the supply of cold water sinking to the sea 
floor to feed the bottom currents. “The deep water warms up because 
it’s not getting as much cold-water replenishment,’ says Gregory John- 
son, an oceanographer with the National Oceanic and Atmospheric 
Administration (NOAA) in Seattle, Washington, who co-authored 
the 2010 analysis. 

An as-yet-unpublished analysis, based on initial data from the third 
round of ship surveys, finds similar trends, but researchers have longed 
for more frequent measurements to provide a fuller picture. That could 
happen ifa proposed international project moves forward. Called Deep 
Argo, this would be an array of floats that regularly dive all the way to 
the bottom of the ocean. Johnson is involved in a US consortium that 
is testing 13 floats in a basin off the coast of New Zealand, and another 
nine south of Australia. 

Others are using moorings to monitor deep water flows. Since 1999, 
Gordon has maintained an array of moorings in the Weddell Sea, one 
of the main areas where cold surface waters sink to form ocean bottom 
currents. He has seen the deep water growing less salty in some areas, 
but the long-term trends are not clear’. 

“We are really only scratching the surface of how bottom waters are 
changing, and how that is impacting the large-scale global ocean cir- 
culation,’ he says. 


ALONG THE EDGE 
In January 2015, oceanographers aboard the Australian icebreaker 
Aurora Australis were cruising off the coast of Antarctica when they 
were presented with a unique opportunity. Following a crack in the 
sea ice, they were able to reach the edge of the Totten Glacier, one of 
the biggest drainage points for the East Antarctica ice sheet. No other 
expedition had reached within 50 kilometres of the glacier. 

The team deployed floats and gliders into the waters around and 
underneath the glacier, which is 200 metres thick at its front edge. What 
they found came asa shock. The water at the front of the glacier was 
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3 °C warmer than the freezing point at the base of the glacier. 

“We always thought Totten was too far away from warm water to be 
susceptible, but we found warm water all over the shelf there,’ says Steve 
Rintoul, an oceanographer at the Antarctic Climate and Ecosystems 
Cooperative Research Centre in Hobart, Australia. 

Scientists had already shown’®* that warm-water currents are under- 
cutting the West Antarctic ice sheet in many areas along the peninsula 
where the glaciers extend into the ocean. But Rintoul says that this 
expedition provided some of the first hard evidence that these same 
processes are affecting East Antarctica, raising new questions about 
the longevity of the mammoth ice sheets that blanket the continent. 

There is no clear answer yet for what is driving the warming of these 
near-surface currents. Some explanations invoke changes in the winds 
over the Southern Ocean and the upwelling of warm waters. Others 
focus on fresher surface waters and an expansion of sea ice in some 
areas. The combination of extra sea ice and fresher surface waters could 
create a kind of cap on the ocean that funnels some of the warmer 
upwelling water towards the coast. 

“Every scientist, including me, has their favourite explanation,” 
Gordon says. “But that’s how science works: the more you observe, the 
more complicated it gets.” 

Finding the answers may require recruiting some of Antarctica’s 
permanent residents. Meredith's team at the British Antarctic Survey 
plans to equip Weddell seals with sensors so that the animals can collect 
water measurements as they forage below the sea ice along the conti- 
nental shelf. This zone has particular importance because it is precisely 
where cold water begins its descent into the abyss. 

“The processes that happen in that shelf region are very important 
ona global scale, but measuring them is very difficult,” Meredith says. 
“The seals sort of transcend that barrier.” 

The Weddell seals are just one component of the expedition’s arsenal. 
The team will also send autonomous gliders under the sea ice on pre- 
programmed routes to collect temperature and salinity data down to 
depths of 1,000 metres. Measurements taken from ships will help fill 
in the picture of what happens in this crucial region around Antarc- 
tica — and how it relates to the rest of the global ocean circulation. 

Getting the data is only half the challenge. Ultimately, scientists need 
to improve their models of how currents transport heat, CO, and nutri- 
ents around the globe. Even armed with better measurements, results 
suggest that modellers have a way to go. 

An analysis of data from the ship surveys suggests that upwelling 
ocean water does not rise in a simple pattern near Antarctica. Rather, 
it swirls around the continent one and a half times before reaching 
the surface. And Sarmiento’ team at Princeton found that only the 
highest-resolution models could accurately capture that behaviour. 
Sarmiento says that it could be a while before the models can simulate 
what really happens in this region, but he is confident that day will 
eventually arrive. 

For Russell, it’s as if scientists are at last lifting the veil on the 
Southern Ocean. After she returned from her maiden voyage in 1994, 
she turned to modelling because there wasn’t enough data at the time 
to quantify the effects of the upwelling she encountered. Today she 
has it both ways. Russell is heading the modelling component of the 
SOCCOM project, and she is getting more data than she ever dreamt of. 

“It’s just a wonderful time to be an oceanographer,’ she says, “even as we 
are carrying out this really scary geophysical experiment on our planet” m 


Jeff Tollefson writes for Nature from New York. 


. Sarmiento, J. L. & Toggweiler, J. R. Nature 308, 621-624 (1984). 

. Le Quéré, C. et al. Science 316, 1735-1738 (2007). 

. Landschititzer, L. et al. Science 349, 1221-1224 (2015). 

. Le Quéré, C. et al. Biogeosciences 13, 4111-4133 (2016). 

. Purkey, S. G. & Johnson, G. C. J. Clim. 23, 6336-6351 (2010). 

Gordon, A. L., Huber, B., McKee, D. & Visbeck, M. Nature Geosci. 3, 551-556 
(2010). 

. Joughin, |., Smith, B. E. & Medley, B. Science 344, 735-738 (2014). 

. Rignot, E., Mouginot, J., Morlighem, M., Seroussi, H. & Scheuchl, B. Geophys. Res. 
Lett. 41, 3502-3509 (2014). 


DoRWNYe 


CON 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


DATA FROM L. LANDSCHiTZER 


REUTERS/ANDY CLARK 


COMMENT 


Aglorious and 13 The scientific Scientists react to Joseph L. Birman, 
gory history of New York’s WN currents in Lewis UK plan to reform research human-rights activist and 
Bellevue hospital p.354 ‘> Carroll’s words p.356 and universities p.357 physicist, remembered p.358 


Morning fog masks the view as a landscaper rakes the grass on the Vancouver Convention Center’s living roof in Vancouver, Canada. 


Carbon is not the enemy 


Design with the natural cycle in mind to ensure that carbon ends up in 
the right places, urges William McDonough. 


( "Pans. has a bad name. The 2015 
Paris climate agreement calls for a 
balance between carbon dioxide 

emissions to the atmosphere and to earth- 

bound carbon sinks’. Climate Neutral Now, 

a United Nations initiative, encourages busi- 

nesses and individuals to voluntarily meas- 

ure, reduce and offset their greenhouse-gas 
emissions by 2050. The American Institute 
of Architects has challenged the architecture 
community worldwide to become carbon 
neutral by 2030. The Carbon Neutral Cities 

Alliance, an international network of urban- 

sustainability directors, aims to slash its cities’ 

greenhouse-gas emissions by 80% by 2050. 


‘Low carbon; ‘zero carbon, ‘decarboniza- 
tion, ‘negative carbon; ‘neutral carbon even ‘a 
war on carbon — allare part of the discourse. 
If we can reduce our carbon emissions, and 
shrink our carbon footprint, the thinking 
goes, we can bring down the carbon enemy. 
It's no wonder that businesses, institutions 
and policymakers struggle to respond. 

But carbon — the element — is not the 
enemy. Climate change is the result of break- 
downs in the carbon cycle caused by us: it is 
a design failure. Anthropogenic greenhouse 
gases in the atmosphere make airborne 
carbon a material in the wrong place, at the 
wrong dose and for the wrong duration. It is 


we who have made carbon toxic — like lead in 
our drinking water or nitrates in our rivers. In 
the right place, carbon is a resource anda tool. 

Carbon dioxide is the currency of 
photosynthesis, a source of Earth’s capacity 
for regeneration. Soil carbon is the guar- 
antor of healthy ecosystems and food and 
water security. Carbon atoms are the build- 
ing blocks of life. Wool, cotton and silk are 
carbon compounds, as are many industrial 
polymers and pure ‘supercarbons’ such as 
diamonds and graphene. 

After 30 years of designing sustainable 
buildings and landscapes that manage car- 
bon, I believe it is time to breathe new 


17 NOVEMBER 2016 | VOL 539 | NATURE | 349 
© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


> life into the carbon conversation. Rather 
than declare war on carbon emissions, 
we can work with carbon in all its forms. 
To enable a new relationship with carbon, 
I propose a new language — living, dura- 
ble and fugitive — to define ways in which 
carbon can be used safely, productively and 
profitably. Aspirational and clear, it signals 
positive intentions, enjoining us to do more 
good rather than simply be less bad. 


WORDS DRIVE ACTIONS 

It is easy to lose one’s way in the climate 
conversation. Few of the terms are clearly 
defined or understood. Take ‘carbon neutral. 
The European Union considers electricity 
generated by burning wood as carbon neu- 
tral — as if it releases no CO, at all. Their 
carbon neutrality relies problematically on 
the growth and replacement of forests that 
will demand decades to centuries of com- 
mitted management’. Another strategy is to 
offset fossil-fuel use by renewable-energy 
credits — this still means an increase in the 
global concentration of atmospheric CO). 

Even more confusing is the term ‘carbon 
negative. This is sometimes used to refer to 
the removal of CO, from the atmosphere. 
For example, Bhutan’s prime minister has 
indicated that his country is carbon negative, 
because its existing forests sequester more 
CO, than the country emits and Bhutan 
exports hydroelectric power (see go.nature. 
com/2es9lgt). But aren't trees having a 
positive effect on atmospheric carbon, and 
hydroelectric power a neutral one? 

Carbon sequestration is a long-sought 
goal. It requires two elements: a way to 
capture carbon from the atmosphere or 
a chimney and a way to store it safely and 
permanently. But some so-called carbon- 
storage methods are paradoxical. For exam- 
ple, in enhanced oil recovery, CO, is injected 


into rock formations to flush out remnant 
crude oil, which is eventually burned. 

At the same time, enterprises are start- 
ing to announce their hopes to be ‘carbon 
positive’ by, for example, producing more 
renewable energy than their operations 
require, or by sequestering carbon through 
planting trees. 

Such terms highlight a confusion about 
the qualities and value of CO,. In the United 
States, the gas is classified as a commodity by 
the Bureau of Land Management, a pollut- 
ant by the Environmental Protection Agency 
and as a financial instrument by the Chicago 
Climate Exchange. 

A new language of carbon recognizes 
the material and quality of carbon so that 
we can imagine and implement new ways 
forward (see “The new language of carbo’). 

It identifies three 


“To enable categories of car- 
anew bon — living, durable 
relationship and fugitive — anda 
withcarbon,! characteristic of a 
propose anew _ subset of the three, 
language. ” called working car- 


bon. It also identifies 
three strategies related to carbon manage- 
ment and climate change — carbon positive, 
carbon neutral and carbon negative. 


START WITH THE SOIL 

How do we work with the carbon cycle to 
preserve and enhance the benefits it natu- 
rally provides? From the soil up. 

Carbon is at the heart of soil health. In 
healthy ecosystems, when plants convert 
CO, into carbon-based sugars — liquid 
carbon — some flows to shoots, leaves and 
flowers. The rest nourishes the soil food web, 
flowing from the roots of plants to com- 
munities of soil microbes. In exchange, the 
microbes share minerals and micronutrients 


LA 


A four-storey atrium with indoor and outdoor living green walls helps to provide clean air to 
Park 20|20’s Bosch Siemens Experience Centre in the Netherlands. 


350 | NATURE | VOL 539 | 17 NOVEMBER 2016 
© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


that are essential to plants’ health. Drawn 
into the leaves of plants, micronutrients 
increase the rate of photosynthesis, driv- 
ing new growth, which yields more liquid 
carbon for the microbes and more micro- 
nutrients for the fungi and the plants. Below 
ground, liquid carbon moves through the 
food web, where it is transformed into soil 
carbon — rich, stable and life-giving. This 
organic matter also gives soil a sponge-like 
structure, which improves its fertility and its 
ability to hold and filter water. 

This is how a healthy carbon cycle sup- 
ports life. This flow kept carbon in the right 
place in the right concentration, tempered 
the global climate, fuelled growth and nour- 
ished the evolution of human societies for 
10,000 years. 

Many soil researchers believe it could 
do so again. Ecologist and soil scientist 
Christine Jones, founder of the Amazing 
Carbon Project, describes the “photosyn- 
thetic bridge” between atmospheric carbon 
and liquid carbon, and the “microbial 
bridge” between plants and biologically 
active, carbon-rich soils as twin cornerstones 
of landscape health and climate restoration’. 

David Johnson at the New Mexico State 
University Institute for Energy and the 
Environment in Las Cruces has studied the 
carbon-microbial bridge*. He found that 
the most important factor for promoting 
plant growth and cultivating soil carbon was 
not added nitrogen or phosphorus but the 
carbon inputs from other plants. 


DESIGN FOR LIVING 

Let’s keep those carbon bridges open on 
all landscapes — rural and urban. Let’s use 
carbon from the atmosphere to fuel biologi- 
cal processes, build soil carbon and reverse 
climate change. Let’s adopt regenerative 
farming and urban-design practices to 
increase photosynthetic capacity, enhance 
biological activity, build urban food systems, 
and cultivate closed loops of carbon nutri- 
ents. Let's turn sewage-treatment plants into 
fertilizer factories. Let’s recognize carbon as 
an asset and the life-giving carbon cycle as a 
model for human designs. 

All designs — from products to buildings, 
cities and farms — could be carbon posi- 
tive. This may take a century, but that’s how 
long it took us to get into our current carbon 
calamity. The sooner we start, the better. By 
2030, our exuberantly urbanizing planet is 
expected to convert more habitat and farm- 
land into cities than all previous urban growth 
combined. More than 2 billion urbanites will 
live in homes, attend schools and work in 
factories that are not yet built’. Despite these 
challenges, there are models of hope. 

In 1989 my architecture firm designed 
a day-care facility in Frankfurt, Germany, 
based on ‘a building like a tree’ that could 
be operated by children, who would move 
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THE NEW LANGUAGE OF CARBON 


Too much carbon in the atmosphere is damaging. Instead, it should be retained in durable forms such as plastic and wood or in 
living organisms. Recycling materials and nurturing the soil ensure that carbon ends up in the right places in the right amounts. 


FUGITIVE CARBON 

Has ended up somewhere unwanted and can be 
toxic. It includes carbon dioxide released into the 
atmosphere by burning fossil fuels, ‘waste to 


energy’ plants, methane leaks, deforestation, much 


industrial agriculture and urban development. 
Plastic in the ocean is fugitive carbon. 


WORKING CARBON is 

put to human use. 

Working fugitive carbon | 
includes fossil fuels. | 


CO, and 
plastics 
to the 


CO, to plastics 


DURABLE CARBON 

Locked in stable solids such as coal and 
limestone, or in recyclable polymers that are 
used and reused. It ranges from reusable 
fibre, such as paper and cloth, to building 
and infrastructure elements that can last for 
generations and then be reused. 


Working durable carbon 
includes recycled and 
reused materials. 


LIVING CARBON 

Organic, flowing in biological cycles, 
providing fresh food, healthy forests and 
fertile soil. It is something we want to 
cultivate and grow. Soil includes living carbon 
in the form of fungi, microbes, humus, 
legumes and grasses. 


CO, to soil 


Agriculture uses 


working living 


carbon. 


MANAGEMENT STRATEGIES 


CARBON NEGATIVE 


Actions that pollute the land, water and 
atmosphere with various forms of carbon. For 
example, releasing methane into the atmosphere 
or plastic waste into the ocean is carbon negative. 


solar shutters, open and close windows, 
grow food on roof terraces and irrigate the 
gardens with rainwater. 

The idea of ‘buildings like trees’ and ‘cit- 
ies like forests’ endured, and we started to 
approach our product, building and city 
designs as photosynthetic and biologi- 
cally active, accruing solar energy, cycling 
nutrients, releasing oxygen, fixing nitrogen, 
purifying water, providing diverse habitats, 
building soil and changing with the seasons. 

The Adam Joseph Lewis Center for Envi- 
ronmental Studies at Oberlin College in Ohio, 
which we designed, is a built example of this 
philosophy. It purifies its waste water and 
sewage in an on-site system that produces 
carbon-rich organic compost. This year the 
project is producing solar energy at an annual 
rate of 40% more than it needs. The build- 
ing still relies on the electrical grid when 
solar energy is unavailable. Soon, with new 
and affordable on-site thermal and electric 
battery storage systems, buildings like this can 
be both carbon and energy positive. 

In the Netherlands, Park 20|20 near 
Amsterdam applies these carbon-positive 
design strategies at the campus scale. Next 
door, the Valley at Schiphol Trade Park, the 
country’s national hub for the circular econ- 
omy, will scale these and many other inno- 
vations to create an urban ecology of work, 


CARBON NEUTRAL 


Actions that transform or maintain carbon in 
durable earthbound forms and cycles for use 
across generations; or renewable energy 
such as solar, wind and hydropower that do 
not release carbon. 


supply chains and collaborative spaces. The 
development will be a network of integrated 
buildings, landscapes and technical sys- 
tems operating as a connected whole. Each 
building is oriented to the path of the Sun to 
maximize exposure during winter and shade 
during summer. Photovoltaic arrays and 
green roofs are the system's leaves and roots, 
harvesting renewable energy, absorbing and 
filtering water, producing food and provid- 
ing habitat for other living things in a vibrant, 
sustainable business community. 

The energy sector, too, can be generously 
carbon positive. SunPower, based in San 
Jose, California, and other solar providers 
are developing ‘solar orchards’ — power 
plants that perform as working farms. Rotat- 
ing arrays of elevated solar panels shade the 
earth and provide habitat for grassland, 
which captures water, nitrogen and carbon 
to build soil health, can include legumes to 
fix nitrogen, and can provide food for graz- 
ing animals, in turn providing protein and 
wool. By design, the power plant generates 
an abundance of benefits: renewable energy, 
biodiversity, food, soil restoration, nutrient 
cycling, carbon sequestration, water conser- 
vation, fibre products, and agricultural and 
manufacturing jobs. Thus working durable 
carbon creates and supports living carbon 
while reducing fugitive carbon, all in an 


Merl Phe wh wy 


Actions that convert atmospheric carbon to 
forms that enhance soil nutrition or to durable 
forms such as polymers and solid aggregates. 
Also includes the recycling of carbon into soil 
nutrients from organic materials, food waste, 
compostable polymers and sewage. 


economically robust and profitable model. 
Such designs offer an inspiring model for 
climate action. It all starts with changing the 
way we talk about carbon. Our goal is sim- 
ple and positive: a delightfully diverse, safe, 
healthy and just world — with clean air, soil, 
water and energy — economically, equitably, 
ecologically and elegantly enjoyed. = 


William McDonough is an architect 
and author specializing in sustainable 
development. He is founder of William 
McDonough + Partners and McDonough 
Innovation, and co-founder of MBDC (a 
design chemistry firm), in Charlottesville, 
Virginia, USA. He is also a founder of the 
Cradle to Cradle Products Innovation 
Institute. 

e-mail: info@mcdonough.com 
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Surgery at Bellevue hospital in New York City in the 1870s. 


PUBLIC HEALTH 


Gore and glory 


David Dobbs extols a history of New York’s Bellevue 
hospital, a crucible of discovery in medicine. 


o many Americans, the name Bellevue 
| signifies ‘psychiatric facility’ as much 
as Bedlam does to Britons. The psychi- 
atric unit of the New York City public hospital 
gained fame from the stream of cultural icons 
passing through its portals. Writer Delmore 
Schwartz arrived in handcuffs after trying to 
strangle a hostile book reviewer; jazz great 
Charles Mingus checked in voluntarily, later 
composing the song Lock ’Em Up (Hellview 
of Bellevue). Yet, as historian David Oshinsky 
shows in his sweeping, eponymous chroni- 
cle, this oldest, busiest, most storied of New 
York hospitals deserves equal recognition as 
a fount of medical discovery. 

From its beginnings as a colonial 
almshouse in 1736, Bellevue had important 
roles in one major advance after another. It 
opened the first US maternity ward (1799), 
nursing school (1873), children’s clinic (1874), 
emergency department (1876), mental-health 
ward (1879) and, in the famed Carnegie 
Laboratory (1884), pathology lab. It was the 
first US medical centre to successfully tie off 
a femoral artery, and to report tuberculosis 
as treatable. It pioneered lasting innovations 
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— ee in medical education, 
i public-health outreach 
and professional nurs- 
ing; the use of ambu- 
lances, amputation, 
anaesthesia and anti- 
sepsis; and successful 


treatment of AIDS and 

/ Ebola. It has achieved 

Bellevue: Three all this while struggling 

Centuries of to honour a pledge to 
Medicine and treat all comers. 

Mayhem at That pledge 


America’s Most 
Storied Hospital 
DAVID OSHINSKY 
Doubleday: 2016. 


guarantees a huge 
patient load, heavy 
on “undocumented, 
uninsured, and 
undomiciled” people 
shunted from other New York hospitals. 
Many arrive shattered by infectious dis- 
eases, mental illness and poverty. This 
“municipal eyesore most New Yorkers 
couldn't imagine living without” has repeat- 
edly had to ask the city to give yet more bil- 
lions, which it always does. The challenges 
that routinely almost crush the institution 


2016 


also drive its boundless creativity. 
Oshinsky draws deftly from this history 
to tell stories of an unlikely institution and 
Western medicine’s halting, bloody march 
forward. He also reveals the torturous 
ambivalence over universal patient care in 
a nation where treatment decisions remain 
largely revenue-driven. To all of it he brings 
an eye for particulars, a knack for apt quotes 
and a talent for braiding multiple themes. 
One such theme is how often medicine 
does harm because of compassionate and 
competitive pressures to do something. In 
the early nineteenth century, physicians were 
obliged to “cure quickly’, as one complained, 
“or give place toa rival”. Not even presidents 
were immune to the results of such pressure. 
In 1881, Bellevue surgeon Frank Hamilton 
inserted his dirty fingers into the wounds 
received by President James Garfield dur- 
ing an assassination attempt, probably 
contributing to Garfield’s slow, gruesome 
death. Hamilton billed the nation US$25,000 
(almost $600,000 today); he was paid $5,000. 
Oshinsky similarly enlivens Bellevue's 
more successful work in forensic pathology 
and AIDS treatment. A century ago, the hos- 
pital’s Charles Norris, tall, gruffand confident, 
provided the archetype for the no-nonsense 
medical examiner. Spending family wealth to 
stock his labs and working obsessively to solve 
mysterious deaths, Norris played a prime 
part in making forensic pathology a medi- 
cal and legal force in the United States. His 
best employee, chemist Alexander Gettler, 
became the mould for the geeky forensic ace 
of popular culture. Gettler discovered how to 
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tell whether a floating corpse had drowned 
or been tossed dead into the water, and how 
to use soil stains or thread fragments to tie a 
criminal to a crime. A fellow expert opined 
that Gettler “sent more criminals to the elec- 
tric chair through his tests than any police 
detective”. 

The AIDS chapter astonishes more grimly. 
New York, with a high incidence of intra- 
venous drug use and promiscuous unpro- 
tected sex in the late 1970s, was hit hard by 
HIV. One of the first people to be recognized 
as having AIDS presented himself in 1980 to 
Fred Valentine, an infectious-disease specialist 
at Bellevue, with the rare pneumocystis pneu- 
monia. A second case came days later. Both 
men soon died, as did eventually hundreds 
who followed them into Bellevue and nearby 
New York University Hospital. By 1985, one- 
fifth of Bellevue’ patients had HIV. At first, 
these hospitals, with others in San Francisco, 
California, and the Centers for Disease Con- 
trol and Prevention in Atlanta, Georgia, 
fought a seemingly futile effort to contain this 
accelerating epidemic, even as cases began to 
be recognized in Africa and on other conti- 
nents (T. Tansey Nature 533, 468; 2016). 

Oshinsky makes vivid the piteous 
condition of early patients, and caretakers 
struggles with fear of contagion. Bellevue 
staff’s sense of “therapeutic impotence’, he 
shows, finally gave way to exhausted relief in 
the mid-1990s. Teams led by Valentine and 
fellow medic David Ho separately devel- 
oped similar antiretroviral (or ART) drug 
cocktails that inhibited the virus’s replication 
and made AIDS manageable. 

Bellevue’s record shines bright amid 
the shameful behaviour of many in the US 
medical system around HIV and AIDS. In 
1986, with more than 20,000 dead and tens 
of thousands more infected, the American 
Medical Association informed its members 
that it was legitimate to ignore these patients. 
The United States, Oshinsky notes, continues 
to ration care according to status and wealth 
— an ailment that the Obamacare medical- 
insurance statute salves but does not cure. 
This year, for instance, conservatives in the 
US Congress repeatedly sabotaged bills 
meant to fund the fight against the expand- 
ing Zika epidemic which, up to that point in 
the country, had affected mainly the US ter- 
ritory of Puerto Rico. The funding, requested 
by President Barack Obama's administration 
in February, finally passed on 28 September. 

Bellevue is rich in anecdote, history, 
personality and narrative. It is also an indict- 
ment of a society’s failure, almost 300 years 
on, to extend medical benefits to all, despite 
the efforts of this extraordinary hospital. = 


David Dobbs writes on science, medicine 
and culture. His books include Reef Madness 
and the short memoir My Mother's Lover. 
e-mail: david.a.dobbs@gmail.com 
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Books in brief 


Beyond Earth: Our Path to a New Home in the Planets 

Charles Wohiforth and Amanda R. Hendrix PANTHEON (2016) 

“They will go boating on lakes of liquid methane and fly like birds 
in the cold, dense atmosphere.” Life on Saturn’s moon Titan could 
prove exhilarating, suggest writer Charles Wohlforth and planetary 
scientist Amanda Hendrix in this synthesis of space-colonization 
science. Their thought experiment balances possible futures with 
a raft of facts on advances in spacecraft technology, robotics and 
space medicine. Crucially, they parse the push and pull between 
cautious governments and gung-ho entrepreneurs, concluding that 
the two may ultimately add up to a propulsive combination. 


A Wretched and Precarious Situation 

David Welky W. W. NORTON (2016) 

In 1906, polar explorer Robert Peary sighted a mysterious region 
northwest of Canada’s Arctic Archipelago. Dubbing it Crocker Land, 
he enlisted anthropologist Donald MacMillan and geologist Elmer 
Ekblaw as part of a US-Inuit team to explore the landmass under 
the aegis of New York’s American Museum of Natural History. What 
happened next, historian David Welky reveals in this engrossing 
account of the five-year effort (1913-17), involved not only the 
classic litany of illness, privation and howling blizzards, but a 
singularly bizarre finding about Peary’s original sighting. 


The Unnatural World 

David Biello SCRIBNER (2016) 

Forget the Anthropocene epoch, argues journalist David Biello. What 
we need is a broader “Anthropozoic” era, a lasting future anchored 
in inspired planetary stewardship and intelligent optimism. His lucid 
survey of researchers straining to contain today’s global environmental 
shifts (from climate change to dwindling marine biodiversity) mixes 
their personal scientific journeys with contextualizing discoveries. 
Among them are marine biologist Victor Smetacek, experimenting 
with iron fertilization to draw carbon dioxide into the oceans, and 
ecologist Erle Ellis, monitoring forestation with drones. 


Where Poppies Blow: The British Soldier, Nature, the Great War 
John Lewis-Stempel W&N (2016) 

For traumatized, trench-bound British soldiers caught up in the 
carnage of the First World War, birdwatching and botany offered 
solace. So reveals John Lewis-Stempel in this riveting study drawing 
on verse, letters and field notes by men who served, from zoologist 
Dene Fry to poet Edward Thomas. He shows how observing the 
nesting larks that twittered above ‘no man’s land’ and natural 
cycles such as the seasons gave a sense of renewal, and how animal 
‘troops’ were inspirationally loyal. A remarkable picture of a human 
bloodbath that took place amid phenomenally rich biodiversity. 


The Secret Lives of Colour 

Kassia St Clair JOHN MURRAY (2016) 

Heliotrope, gamboge, umber: the names of colours are as luscious 
as the hues themselves. Kassia St Clair serves up a chromatic 

buffet of the chemistry, history and cultural associations of 75 dyes, 
pigments and shades, including encapsulations of optics and colour 
theory. The gorgeous, malodorous Turkey red, for instance, was 
made in a “tortuous process involving rancid castor oil, ox blood 
and dung”, and the Roman emperor Nero used a large emerald as 
“proto-sunglasses” while watching gladiatorial combat. Barbara Kiser 
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Untangling Alice 


Gillian Beer reveals the currents in Lewis Carroll’s worlds. 


ewis Carroll's lucid dreams draw in 
Le fresh contexts for interpreta- 

tion as they continue to delight and 
disturb. How did Charles Dodgson — 
Carroll’s real name — do it? The story used 
to be that this ‘mediocre’ English mathema- 
tician, isolated at Christ Church College, 
Oxford, somehow miraculously produced 
Alices Adventures in Wonderland (1865) 
and Through the Looking-Glass, and What 
Alice Found There (1871). The unlikely tale 
persisted, perhaps because so little evidence 
seemed to have survived of Dodgson’s read- 
ing; at his death, most of his books were 
hastily sold off. 

But recent years have seen the publica- 
tion of Edward Wakeling’s The Diaries of 
Lewis Carroll (The Lewis Carroll Society, 
1993-2007) and Charlie Lovett’s Lewis Car- 
roll Among His Books (McFarland, 2005) — 
a descriptive catalogue of the volumes that 
Dodgson owned or is known to have read. 
These have given me convincing external 
evidence of the range of his curiosity: across 
evolution and logic, dream theory and 
botany, mathematics and dictionaries, ani- 
mal welfare and new ideas about language. 
I have also explored his fascination with 
parody and the satirical weekly magazine 
Punch. In my new book, Alice in Space 
(University of Chicago Press, 2016), I inves- 
tigate the array of debates to which Carroll 
playfully responded, and so find lost jokes 
and poignant vanished references. 

When I started writing it more than a 
decade ago, I wondered how far intuition and 


the layers of his jokes, sliding infant puns 
above learned references. He had a teasing 
openness to the ideas being pursued by his 
contemporaries in science, such as Thomas 
Henry Huxley, Charles Darwin, by logician 
John Stuart Mill and by lawmakers, pho- 
tographers, museum-makers and novelists 
including George Eliot and Emily Bronté. 
The Alice books move like quicksilver — 
sleek globules of sense splitting and straying 
through the maze. This was a period when 
possible worlds, probability and ideas of 
space as curved or flat were being eagerly 
discussed. What mathematician James 
Joseph Sylvester described as the “rumpling 
of the page” of 3D space fascinated, among 
others, physicist Hermann von Helmholtz, 
who observed that Euclid’s geometry holds 
good for the plane, but not for “surfaces flex- 
ible without change of dimensions”. Thus 
Alice goes through the looking-glass. 
Carroll’s taste for games and play was 
shared by many of his contemporaries, and 
understood as essential intellectual stimulus. 
For instance, Sylvester’s 1869 address to the 
British Association for the Advancement of 
Science emphasized the need to quicken the 
mind of students “with the doctrine of the 
imaginary and the inconceivable”. The logi- 
cian Augustus de Morgan wrote in 1859 “All 
that is thinkable is possible; all that is impossi- 
ble is unthinkable: that is, so far as our knowl- 
edge can go.” Carroll, who knew both men, 
put it this way in an encounter between Alice 
and the White Queen in Looking-Glass: 


familiarity with Victorian intellectual cul- ' oN 


ture should take me in asserting Carrol’s A/" hg 
aN x \ things.” 


ly 


participation in the ideas throng- 2, | y 

ing around him. I had to rely on 3 De 
the Alice books for evidence = i 
of allusion and parodies. Now 


I have a fuller picture of how 3 wx. ea Sy; 


Carroll used fantasy to pursue $e 
thoughts — on radical math- | 
ematics and Boolean logic, for #8 lll: 
example — that he constrained ' 
in his professional life as a devout 
Euclidean (F. E Abeles Nature # 
527, 302-304; 2015). # 
The Victorian culture within 
which the Alice books were i aa 
written is largely invisible to us 
now. It was a period of immense 
intellectual upheaval in fields 7 
from mathematics to language 
theory, evolution and educa- re 
tion. Carroll slips these ideas into 
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Alice laughed. “There’s no use trying,” 
she said: “one cart believe impossible 


“T daresay you haven't had much 
practice; said the Queen. “When I was 
your age, I always did it for half-an-hour 
a day. Why, sometimes I’ve believed as 
many as six impossible things before 
breakfast” 


The idea of rigorous training in 
believing the impossible nicely 

iN tilts at Victorian learning ritu- 
i als in the style of Gradgrind, 
Charles Dickens’s rigid peda- 
gogue in Hard Times (1854). 
Nonsense was also valued by 
the physicist James Clerk Max- 
well, who appreciated both the Alice 
books. His 1873 
poem ‘Molecular 
evolutiom declared: 


One of Alice’s 
transformations. 


What combination of ideas, 

Nonsense alone can wisely form! 

What sage has half the power that she has, 
To take the towers of truth by storm? 


Carroll put several systems in motion at 
once in his scenes, to produce new absurdi- 
ties and insights. For example, the names of 
the snooty talking flowers in Looking-Glass 
parody the then-popular 1855 poem ‘Come 
into the garden, Maud, and the fashionable 
‘language of flowers’ for lovers. They also 
nod to a recent theory about the origin of 
language in animal sounds that Carroll’s 
friend, the philologist Max Miiller, scorned 
as ‘the bow-wow theory’: 


“But what could it do, ifany danger came?” 
Alice asked. 

“Tt could bark, said the Rose. 

“It says ‘Bough-wough!” cried a Daisy. 
“That's why its branches are called boughs!” 


Alice, eager and adamant, is also a trans- 
formative figure. In Wonderland she can, 
after trial and error, manage the scale of her 
body: wish fulfilment for any child subject 
to the awkward demands of growing up. 
And the often improbable relation of child- 
hood appearance to the later adult was being 
newly dramatized now that they could be set 
side by side in photographs. Transformation 
was a perturbing idea for many Victorians, 
with its threat of kinship between species (in 
the books, a baby turns into a pig; bottles, 
plates and forks become birds). All taxo- 
nomic systems seemed unstable in the wake 
of evolutionary theory — to say nothing of 
the threat to human exceptionalism. After 
Alice’s neck grows, she encounters a pigeon 
defending its eggs. Hearing that little girls 
eat eggs, it announces, “then they’re a kind 
of serpent”. 

Carroll emphasizes in his sideways stories 
that everything has a voice and every thing 
aright to speak, be it queen, cat, door, Time, 
Mock Turtle, egg or child. The Unicorn in 
Looking-Glass remarks of Alice: 


“T always thought they were fabulous mon- 
sters!” said the Unicorn. “Is it alive?” 

“Tt can talk,” said Haigha solemnly. 

The Unicorn looked dreamily at Alice, and 
said: “Talk, child” 


Talk is the stuff of life in these books: deft 
and provoking talk that draws on an array of 
then-current ideas and yet holds steady the 
surface tension of the story. m 


Gillian Beer is emeritus professor at the 
University of Cambridge, UK, and an 
honorary fellow of Clare Hall and Girton 
Colleges, Cambridge, and St Anne’ College, 
Oxford. Alice in Space is published this month. 
e-mail: gpb1000@cam.ac.uk 
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UK research reform: 
protest now 


The grass-roots campaign 

group Science is Vital (http:// 
scienceisvital.org.uk) shares the 
Royal Society president's concerns 
over aspects of the government's 
Higher Education and Research 
Bill (see V. Ramakrishnan Nature 
538, 459; 2016). 

The bill does not yet offer 
sufficient protection for the 
operational autonomy of the 
UK research councils. Neither 
does it provide legal guarantees 
that future reforms will take into 
account the views of Parliament 
or the research community. 

We are also alarmed by 
mechanisms the bill could use to 
aid entry and exit of institutions 
from the higher-education 
‘market: For example, the new 
Office for Students will have the 
power to revoke university status 
without parliamentary assent. 
Such powers undermine support 
for the autonomy of seats of 
learning and enquiry that have 
proved their cultural worth for 
generations. 

In our view, verbal assurances 
from the Minister for Universities 
and Science are insufficient. The 
stated “primacy of scientific and 
academic decision-making” must 
be enshrined in the bill. 

As the bill is readied for its third 
reading before being debated in 
the House of Lords, we call on 
the UK research community to 
contact their MPs urgently to 
relay concerns about the dangers 
in this proposed legislation (see 
go.nature.com/2fsmclc). 
Stephen Curry Imperial College 
London, UK. 

Jenny Rohn University College 
London, UK. 

Andrew Steele Francis Crick 
Institute, London, UK. 
s.curry@imperial.ac.uk 


UK research reform: 
poor timing 
Science and universities in the 


United Kingdom surely do not 
need a major and controversial 


restructuring during the 
approach to Brexit (see Nature 
538, 5; 2016). The government’s 
proposed Higher Education 
and Research Bill stands to 
erode university autonomy, 
downgrade individual research 
councils and concentrate 
executive authority over science 
into a single ‘supremo. 

It is plainly desirable for 
the research councils to 
collaborate more smoothly. 
And ministers need better 
advice on apportioning 
funding between councils and 
on such matters as balancing 
small-scale, ‘responsive 
mode’ grants against large 
strategic initiatives. However, 
these inadequacies can be 
remedied without the wholesale 
reorganization envisaged in the 
bill. From my perspective as 
Astronomer Royal and former 
Royal Society president, the 
research councils work better 
than most government agencies 
and need only fine-tuning. 

A good start would be to 
ensure that there is a senior 
independent voice in Whitehall 
by reviving the post of Director- 
General of Research Councils, 
supported by a strong advisory 
board. 

Martin Rees Institute of 
Astronomy, University of 
Cambridge, UK. 
mjr@ast.cam.ac.uk 


UK research reform: 
get the facts straight 


For universities with a royal 
charter, the UK government's 
proposed Higher Education 
and Research Bill does not “rip 
up an 800-year-old settlement” 
(Nature 538, 5; 2016). And on 
a factual point, the University 
of Cambridge does not have a 
royal charter. Pope John XXII 
gave us formal recognition in 
1318, and our privileges were 
confirmed by Parliament in 
1571 through the Oxford and 
Cambridge Act. 

A royal charter recognizes 
an institution or group of 


individuals as a single legal 
entity, each with different 
responsibilities and rights. It is an 
exaggeration and simplification 
of the bill’s proposals to say that 
it will revoke these. Instead, 
the bill would legally recognize 
institutional autonomy, the 
principle of dual support and the 
sector's diversity. 
Evidence-based debate will 
improve the proposed legislation, 
for example through the Green 
Paper consultation and the 
Public Bill Committee (see 
also go.nature.com/2ejk7km). 
Discussions will continue as 
part of standard parliamentary 
procedure. The bill does not 
need to be thrown out to protect 
academic freedom. 
Leszek K. Borysiewicz 
University of Cambridge, UK. 
v-c@admin.ac.uk 


Genetics boosts 
US- Cuban links 


As economic doors open 
between the United States and 
Cuba, human genetics offers 
one promising area for scientific 
collaboration (see Nature 537, 
600-603; 2016). Reversing 

50 years of restriction remains a 
formidable task — particularly 
with scant financial and human 
resources. 

Community genetics is 
incorporated into Cuba’s health- 
care system. As in the United 
States, prenatal genetic testing, 
screening of newborns and 
clinical genetics services are all 
available. Comparative studies 
on US and Cuban populations 
could help to clarify the genetic 
contribution to disease — for 
example, by revealing rare 
inherited genomic variants. 

Miami is one of the best- 
positioned US cities to lead such 
scientific partnerships, given its 
social and cultural ties with Cuba 
(Florida is home to 66% of the 
US Cuban population). At the 
University of Miami’s Leonard 
M. Miller School of Medicine, we 
are working to develop exchange 
programmes with medical 


institutions in Cuba. These 
mutual learning opportunities 
should foster multidisciplinary 
research partnerships in the next 
generation of medical geneticists. 
Rosario Isasi, Stephan L. 
Zuchner University of Miami, 
Florida, USA. 

Roberto Cafiete Villafranca 
University of Medical Sciences, 
Matanzas, Cuba. 
risasi@miami.edu 


Precision oncology 
is not an illusion 


In our view, it is unreasonable to 
condemn personalized medicine 
for oncology on the basis of the 
limited success of a few trials 
(see V. Prasad Nature 537, S63; 
2016). We suspect that those 
failures were more likely to 

be caused by shortcomings in 
methodology. 

With more than 40 precision- 
oncology drugs on the market, 
such therapies are helping tens 
of thousands of US patients by 
targeting specific molecular 
abnormalities. For example, 
mutations in the gene that 
encodes the epidermal growth- 
factor receptor are likely to occur 
in 10% of the 186,240 or so new 
cases of non-small-cell lung 
cancer predicted for 2016 (see 
go.nature.com/2fpxits). And 
the 8,220 people with chronic 
myeloid leukaemia (CML) 
predicted for this year will 
almost all carry the ‘Philadelphia 
chromosomal translocation (see 
go.nature.com/2fbbarj). 

There are precision-oncology 
drugs to counteract both defects. 
Indeed, the life expectancy of 
people with CML is now starting 
to approach that of the general 
population (S. Saussele et al. 
Blood 126, 42-49; 2015). 
Edward Abrahams Personalized 
Medicine Coalition, Washington 
DG, USA. 

Stephen L. Eck* Astellas, 
Northbrook, Illinois, USA. 
eabrahams@ 
personalizedmedicinecoalition.org 
*Competing interests declared; see 
go.nature.com/2efcnnj. 
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OBITUARY 


Joseph L. Birman 


(1927-2016) 


Physicist who helped hundreds of scientists to escape oppression. 


He attended an unofficial seminar in 

Moscow for scientists like me who had 
lost access to academic institutions in the 
Soviet Union because of our political views 
or because we had applied to leave the 
country. We gathered in the apartment of 
a computer scientist who was under KGB 
surveillance (and later spent five years in 
prison and exile for anti-Soviet actions). This 
was before the Internet and social media. 
National borders were closed; we felt isolated 
and threatened by our government. 

Although few Western academics dared 
attend our gatherings, there were many more 
people than chairs. Birman, a tall man, sat 
uncomfortably on the floor, trying to find 
space for his legs. Then, as always, he was 
talkative and cheerful: everybody’s uncle. 

Birman, who died on 1 October, was 
born in New York City on 21 May 1927, the 
grandson of Jewish immigrants from Rus- 
sia. In 1943, he graduated from the Bronx 
High School of Science, famously an incuba- 
tor of prominent researchers. He received a 
bachelor’s in science from the City College 
of New York and a doctorate in theoretical 
physics from Columbia University in 1952, 
going on to work on the optical properties 
of semiconductors at GTE Laboratories in 
New York. A decade later, he become a pro- 
fessor at New York University, and in 1974, 
he joined the faculty of City College, where 
he remained until his death. 

Sharply experimental in his thinking and 
prohibitively mathematical, Birman dem- 
onstrated how the branch of mathematics 
known as group theory can be applied to 
understand transitions between crystal 
phases and to predict light scattering and 
other optical properties of solids. 

He leveraged the respect he gained from 
seminal papers in the 1960s and 1970s into 
advocacy for hundreds of scientists. In a 
letter endorsing Birman for the Andrei 
Sakharov Prize of the American Physical 
Society (APS), which recognizes scientists 
who fight for human rights, Iranian physi- 
cist Hadi Hadizadeh wrote: “His efforts to 
get me released from detention and solitary 
confinement in 2001 will not be forgotten by 
me, my family, and many scientists world- 
wide.” Winning the award in 2010, Birman 
was delighted to see his name attached 
to that of the notable Soviet dissident and 
nuclear physicist. 

Birman’s trips to the Soviet Union began 


| Ez met Joseph L. Birman in 1979. 


in the 1970s with official invitations from the 


Soviet Academy of Sciences. During those 
trips he learned about the plight of Jewish sci- 
entists in the country. They were often denied 
promotion, travel abroad and positions at top 
research institutions. Applying for an exit visa 
frequently resulted in loss of employment, but 
rarely in permission to leave. Open protests 
led to arrests and imprisonment. 

Birman used his travels and eminence to 
challenge the heads of Soviet research insti- 
tutions on behalf of scientists caught in this 
plight. It is thanks to the efforts of him and 
his colleagues that I did not end up in jail, 
despite multiple KGB interrogations, and 
was finally allowed to leave the Soviet Union. 

In the early 1990s, when many scientists 
in Russia were finally allowed to emigrate, 
Birman helped to establish the Program for 
Refugee Scientists in the United States, rais- 
ing funds from private foundations. This 
supported visiting positions for more than 
ahundred émigré scientists in US universi- 
ties and gave them time to secure permanent 
positions in industry and academia. 

Birman played a crucial part, along 
with particle physicist Robert Marshak, 
in recovering a generation of Chinese 
physicists lost to Mao Zedong’s cultural 
revolution in the 1960s and 1970s. Dur- 
ing this time, most scientific research 


358 | NATURE | VOL 539 | 17 NOVEMBER 2016 
© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ceased, concepts such as Einstein’s theory of 
relativity were denounced as bourgeois and 
scientists were sent to do manual labour 
in the countryside. In 1983, Birman and 
Marshak travelled to Beijing on behalf of 
the APS and signed an agreement with the 
Chinese Academy of Sciences and the Min- 
istry of Education that brought more than 
60 middle-aged physicists to work in labs 
throughout the United States for up to three 
years. Many leaders of Chinese physics are 
alumni of that programme, and the scientific 
cooperation between Chinese and US physi- 
cists that now exists evolved largely from it. 

When the programme came to an end 
in the tragic aftermath of the Tiananmen 
Square protests in 1989, Birman redirected 
his efforts to achieve justice for Chinese 
scholars who openly spoke their minds. 
He would get phone calls and even surprise 
visits from Chinese scientists. He welcomed 
and did everything in his power to help these 
people, counselling them on how to man- 
age their careers and providing contacts and 
recommendations. 

Birman chaired human-rights commit- 
tees at the APS and the New York Academy 
of Sciences and in that capacity wrote hun- 
dreds of letters to heads of governments, 
kings and religious leaders. He publicized 
cases of unjustly imprisoned scientists. For 
more than 40 years, he served as vice-chair 
of the Committee of Concerned Scientists 
dedicated to protecting human rights and 
scientific freedom around the world. 

Joe met Joan Sylvia Lyttle when they were 
both at graduate school. They married in 
1950 and had three children. She became 
a professor of mathematics at Columbia 
University. The day before Joe died, he and 
Joan had spent hours discussing a potential 
overlap between her work on topology and 
his model of how the phase of a particle is 
influenced by its trajectory through space. 

As much as Joe loved physics, getting a 
scientist out of prison had infinitely greater 
value to him than any scientific achieve- 
ment. As a physicist and a humanitarian, 
Joe's influence touched so many lives. He will 
be dearly missed by his friends and remem- 
bered by hundreds of people he helped. m 


Eugene M. Chudnovsky is a distinguished 
professor of physics at the City University of 
New York and co-chair of the Committee of 
Concerned Scientists. 

e-mail: eugene.chudnovsky@lehman.cuny.edu 
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OPTICAL PHYSICS 


Clear directions for random lasers 


Random lasers use disordered structures to produce light, which is usually emitted in many directions. A random laser 
that can produce a collimated beam offers a wide range of applications, from imaging to security scanning. 


DIEDERIK S. WIERSMA 


special class of laser called a random 

laser is cheap, easy to fabricate and 

has emission properties that, to a cer- 
tain extent, resemble those of a regular laser’. 
However, light from a random laser is usually 
emitted in an uncontrolled way, over a broad 
range of angles (Fig. 1a)”. Writing in Optica, 
Schonhuber et al.’ report a breakthrough in 
this respect — namely, the realization of a 
random laser whose directionality of emis- 
sion can be controlled. The authors even cre- 
ate a collimated beam, while at the same time 
maintaining the rich spectral behaviour that 
characterizes random lasers. 

The essential difference between regular 
and random lasers is the way in which the 
‘optical modes’ are formed. In a regular laser, 
light is typically reflected back and forth 
between two parallel mirrors, one of which is 
partially transparent so that some of the light 
can get through. If the distance between these 
mirrors matches the wavelength of the light (or 
a multiple of it), a resonance will occur — just 
like a violin string that emits sound at a wave- 
length that matches its length. Optical modes 
correspond to the wavelengths or, equivalently, 
frequencies of light at which these resonances 
exist. We now know that even the most com- 
plex disordered photonic structures exhibit 
such resonances, which allows their optical 
response to be described in terms of modes’. 
A wealth of these disordered structures is 
currently used for light generation by 
random lasers. 

For a pair of parallel mirrors, the spatial 
distribution of light at the resonance frequen- 
cies is simple: it consists of a series of maxima 
and minima along a straight line. By contrast, 
the corresponding light distributions for dis- 
ordered photonic structures are complex and 
can differ in nature, ranging from being highly 
spatially localized to extending over the entire 
structure of interest’. Depending on the type 
of structure, the number of optical modes can 
vary from a few to very many, each having a 
characteristic light-intensity spectrum. Under 
normal conditions’®, the emission of a random 
laser is the result of a delicate interplay of all 
of its modes and their corresponding spectra. 

Schonhuber and colleagues managed to 
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Figure 1 | A collimated beam froma random laser. a, In a random laser, light is amplified as it is 
transferred between particles in a disordered photonic structure. The light from a random laser is usually 
emitted in an uncontrolled way, over a broad range of angles. b, Schénhuber et al.’ sandwiched the 
photonic structure between a layer of metal and a planar waveguide, which restricted the propagation of 
light to a plane. The authors then inserted several small holes in the waveguide at random positions. By 
carefully optimizing the design of the waveguide and the properties of the light, the authors produced a 


collimated laser beam. 


produce collimated light from a random 
laser by making cunning use of the laser’s 
optical modes. The authors used a planar 
waveguide — a structure that restricts the 
propagation of light to a plane — in which 
several small holes had been inserted in 
random positions (Fig. 1b). This type of 
configuration has two main advantages over 
previous studies*®. First, the in-plane scatter- 
ing of light from the holes produces disordered 
modes whose spectra have varying spatial 
distributions. Second, the holes allow a tiny 
fraction of the light to scatter out of the plane 
and be emitted from the laser. 

By carefully adjusting parameters such as 
the density and distribution of holes in the 
waveguide, Schénhuber and collaborators 
achieved the optimal in-plane confinement 
of light, while ensuring a sufficient amount of 
emission out of the plane. The same type of 
configuration had previously been proposed 
for enhancing the efficiency of thin-film solar 
cells’, which is perhaps not surprising because 
good absorbers of light can also be used as effi- 
cient emitters. 

Next, Schonhuber et al. adjusted the polari- 
zation and phase of the laser light to obtain a 
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beam that was perpendicular to the waveguide 
and almost perfectly collimated. This concept 
had already been explored theoretically* and 
has now been put into practice. At the same 
time, the authors maintained the advantages 
that come with random photonics: ease of 
fabrication, miniaturization and emission of 
photons that are coherent (in-phase) and that 
have a wide range of wavelengths. 

Previous studies that produced light from a 
random laser using a disordered array of holes 
in a semiconductor membrane were limited 
to emissions in the mid-infrared range’ (fre- 
quencies of a few tens of terahertz; 1 THz is 
10? Hz). But Schénhuber and colleagues’ ran- 
dom laser can produce light at longer wave- 
lengths (frequencies of a few THz), which is 
ideal for applications in areas as diverse as 
biomedical imaging, environmental monitor- 
ing and security scanning. The authors’ device 
could also be used in manufacturing because 
common packaging materials such as card- 
board and plastics are transparent to terahertz 
radiation”. 

However, one should bear in mind that 
using random optical modes also means 
that the emission spectra will be random. 


Fortunately, the spectra produced by the authors 
are stable over time, and reproducible because 
there is limited competition between the 
different modes. But the fine structures of these 
spectra are disordered and contain a random 
collection of spikes in intensity. Using a slightly 
different waveguide produces a spectrum that 
has a distinct fine structure. Therefore, truly 
broadband emission, which includes light of 
all wavelengths within a given bandwidth, could 
be obtained only by putting together a large 
collection of such random lasers. 

The intrinsic disordered nature of random 
laser emission could, however, be turned into 
an advantage. A large array of miniature emit- 
ters could serve as a light source with an intri- 
cate spectrum, the intensity spikes of which 
could be switched on and off. There is also 
an opportunity for applications in the field of 
security, in which the unique fine structure of 
the individual spectra could be used for tag- 
ging objects. One example is vehicle recogni- 
tion (both military and civilian). The authors’ 


MOLECULAR BIOLOGY 


collimated beam, combined with its detailed 
and unique spectrum, could be used to identify 
vehicles at large distances that would otherwise 
be invisible to the human eye and undetect- 
able using electronic equipment. The complex 
emission spectra and detailed fine structure 
would also have advantages over regular, 
single-frequency laser light in distinguishing 
weak signals from background noise — for 
instance, in imaging applications. 

Random lasers are becoming more ordered 
and regular lasers are benefiting from disor- 
der. Where is the field going? Random lasers 
have turned out to be less mysterious than 
initially thought, and we have gained a much 
better understanding of their rich and complex 
behaviour. The enormous range of materials 
that have been shown to exhibit random laser 
emission could suggest that random lasers will 
one day be more common than their regular 
counterparts. The clever use of disordered 
optical modes will help us to take advantage 
of the fascinating properties of random lasers 


Mature proteins braced 
by a chaperone 


Hsp70 chaperone molecules help other proteins to fold, and were thought to bind 
mainly to unfolded proteins. Single- molecule experiments now suggest that 
Hsp70s can also stabilize almost fully folded proteins. SEE LETTER P.448 


QINGLIAN LIU & ELIZABETH A. CRAIG 


( ‘on function depends not only on the 
proper folding and localization of pro- 
teins when they are synthesized, but 

also on maintaining the well-being of fully 

folded (mature) proteins, and on their dis- 
posal when they are irretrievably damaged’. 

Seventy-kilodalton-sized heat-shock pro- 

teins (Hsp70s) are a major class of ubiquitous 

molecular ‘chaperones’ that have key roles in 
maintaining protein homeostasis”’. Hsp70s 
interact with short stretches of amino-acid 
residues in extended conformations, such as 
those in unfolded proteins’, and thus prevent 
such proteins from misfolding and aggregat- 
ing. But on page 448, Mashaghi et al.” report 
that Hsp70s are more flexible in their binding 
ability than was previously appreciated: they 
also interact with intermediates in folding 
processes, and with nearly fully folded pro- 
teins. This suggests that Hsp70s play a bigger 
part in protein homeostasis than was thought. 

Analysing how protein folding is facilitated 
by molecular chaperones has been challeng- 
ing — particularly for those chaperones, such 
as Hsp70, for which nucleotide binding and 


hydrolysis drive cycles of interactions with 
proteins. Such chaperone-protein interactions 
are transient, and the conformations adopted 
by the substrate proteins are often highly vari- 
able and change over time. This is particularly 
true for the intermediates formed during 
protein folding. 

Single-molecule analytical techniques 
can overcome these challenges, and thus are 
powerful approaches for addressing chaperone 
function. For example, the force generated by 
optical tweezers — a laser device that can hold 
and move microscopic objects — is ideal for 
pulling apart the folded structures of mature 
proteins to produce fully or partially unfolded 
proteins. Subsequent relaxation of the pull- 
ing force enables the refolding process to be 
observed. Chaperones, co-chaperones (pro- 
teins that assist chaperones in their functions) 
and nucleotides can be added to the molecule 
being studied to probe their effect on protein 
refolding and stability. 

Mashaghi et al. used optical tweezers 
to analyse how DnaK, an Hsp70 from the 
bacterium Escherichia coli, interacts with 
maltose-binding protein. Surprisingly, they 
found that DnaK binds to and stabilizes nearly 
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and tame their behaviour to the extent that is 
needed for practical applications. m 


Diederik S. Wiersma is at the European 
Laboratory for Non-linear Spectroscopy 
(LENS), University of Florence, 50019 Sesto 
Fiorentino, Italy, and the National Institute for 
Metrological Research, Turin, Italy. 

e-mail: wiersma@lens. unifi.it 
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fully folded and partially folded proteins, in 
addition to unfolded proteins. These results 
suggest that Hsp70s not only assist the folding 
of unfolded and misfolded proteins, but also 
help to maintain the conformation of mature 
proteins — thus maintaining their function. 
This preventive maintenance for ‘healthy’ 
mature proteins has not previously been 
suggested as a role for Hsp70 chaperones. 

Although examples of Hsp70 binding to 
mature proteins have been reported before, 
the chaperone was thought to interact with 
extended loops in proteins*. Such thinking 
is consistent with structural studies** that 
revealed Hsp70’s peptide-binding cleft to be 
so constrained that only an extended, unstruc- 
tured polypeptide segment of a protein can be 
accommodated in it. The substrate-binding 
domain (SBD) of Hsp70 is dedicated to binding 
polypeptide substrates, and comprises about 
half of the chaperone molecule. A subdomain 
of the SBD, known as SBD, contains the 
peptide-binding cleft, and another subdomain, 
SBDa, forms a lid over the bound peptide 
segment (Fig. 1). SBDa thus stabilizes the 
chaperone-substrate interaction, but without 
forming any substantial direct contact with the 
substrate. 

This model of substrate binding has provided 
a satisfying mechanistic explanation for how 
Hsp70 aids protein folding, especially the 
early steps in which nascent proteins consist of 
unfolded chains. Mashaghi et al. now find that 
the SBDa lid is particularly important in stabi- 
lizing almost-mature protein substrates — an 
intriguing observation that raises the question 
of how Hsp70s bind to such substrates to 
increase their stability. A structure of an Hsp70 
in complex with a mature substrate would help 
to provide insight into the mechanics of this. 

The substrate-binding activity of Hsp70s 
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Figure 1 | Different roles of Hsp70s in protein folding. a, Seventy-kilodalton heat-shock proteins 
(Hsp70s) are molecular ‘chaperones’ that were thought to mainly assist the folding of unfolded, nascent 
protein chains. Unfolded proteins can bind to a cleft in a region of Hsp70 known as substrate-binding 
domain-$ (SBD), and another region (SBDa) forms a lid that closes over the unfolded protein chain. 
The nucleotide-binding domain binds to ATP, which can be hydrolysed to form ADP. The lid is 
predominantly open when ATP is bound and closed when ADP is bound. b, Mashaghi et al.” report 
single-molecule experiments with an Hsp70, and find that it interacts with an almost fully folded protein. 
This suggests that Hsp70s have roles in maintaining the structures of almost fully folded proteins. 


depends on which nucleotide is bound’: 
either ATP or the product of ATP hydrolysis, 
ADP. When ATP is bound, the peptide-bind- 
ing pocket is predominantly open, allow- 
ing easy, rapid access for substrates. When 
ADP is bound, the peptide-binding pocket is 
predominantly closed — reducing substrate 
access but enhancing the stability of the chap- 
erone-substrate interaction after the substrate 
is bound. The ATP-bound state therefore prob- 
ably binds large, mostly folded substrates much 
more readily than the ADP-bound state. Both 
ATP binding and hydrolysis are crucial to the 
conformational changes of Hsp70 that allow 
it to bind and release substrates as they fold. 

It is therefore surprising that Mashaghi and 
colleagues find that Hsp70 is more effective in 
stabilizing the (nearly) mature conformation 
of maltose-binding protein when ADP is the 
predominant nucleotide present, rather than 
ATP. However, the authors note that the ratio 
of ADP to ATP in cells is high during heat 
stress, and therefore propose that the observed 
stabilization is beneficial to the cell under such 
conditions — it might provide a way of main- 
taining proteins in their functional, folded 
conformations for little expenditure of energy. 
This explanation makes sense, but more work 
is needed to understand the physiological rel- 
evance of the results. 

Mashaghi and co-workers’ study provides 
a wealth of information, but the limitations 
of the analysis should be kept in mind. The 
authors tested a single Hsp70: DnaK, a well- 
studied model of these chaperones. But even 
though Hsp70s are highly evolutionarily 
conserved, there are substantial differences 
in both their amino-acid sequences and their 
specific cellular functions. Does the observed 


stabilization of folded protein substrates also 
occur in the Hsp70s of eukaryotic organisms 
(which include plants, animals and fungi)? 
Moreover, Hsp70 activity in protein folding 
depends on the cooperation of two types of 
co-chaperone: J-proteins and nucleotide- 
exchange factors (NEFs)’*’’. These co- 
chaperones, especially NEFs, have much 
more diverse functions and amino-acid 
sequences than Hsp70s, so how do they fit 
into this expanded view of Hsp70 function? 
And most of Mashaghi and colleagues’ single- 
molecule assays involved just one kind of 
protein substrate. How generalizable are the 
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observed effects for other substrates? 

Finally, it must be remembered that the 
unfolding force generated by optical tweezers 
is artificial. It is not clear how representative 
these findings are of the situation in cells. 
More experiments, from single-molecule 
studies to in vivo investigations, will therefore 
be needed to better understand the diverse 
roles of Hsp70s. But in the meantime, Mashaghi 
et al. have provided much food for thought. m 
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Ions surprise in 
Earth’s deep fluids 


Models indicate that there are strong gradients in element concentrations and in 
the pH of fluids at the slab- mantle interface — a major discontinuity deep within 
Earth. This transforms our view of global geochemical transport. SEE LETTER P.420 


DAVID DOLEJS 


he chemical and isotopic composition 

| of Earth’s rocks and minerals carries 

a fingerprint of how and where they 
formed. This information provides a unique 
record of the spatial and time scales of past 


planetary processes and the fluxes of the chem- 
ical elements involved. Interpreting this record 
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requires knowledge of the chemical conditions 
generated when aqueous fluids and minerals 
interact at extreme depths, but such knowl- 
edge has been hard to acquire. On page 420, 
Galvez et al.' report a method for computing 
the composition and chemical speciation of 
aqueous fluids in equilibrium with different 
rock types, and a predictive model for mass 
transport across the top of Earth’s subduction 


zones — the regions at which portions of a 
tectonic plate are being forced underneath 
another plate. 

Mass transport on our planet is driven by 
the motions of plates in Earth’s outer shell 
(the lithosphere), and rocks formed at Earth’s 
surface can be recycled to depths of up to 
200 kilometres. As temperature and pressure 
rise with depth, the rocks undergo complex 
mineral transformations, including the release 
of volatile constituents such as water and car- 
bon dioxide. These fluids are highly mobile in 
rock environments because of their low density 
and high buoyancy (Fig. 1), and act as transport 
agents for diverse chemical elements. 

When the fluids penetrate major disconti- 
nuities in the Earth, such as the boundary 
between subducting slabs of lithosphere 
and the upper mantle, they act asa medium 
for complex dissolution—precipitation reac- 
tions, which allow subducted minerals to 
attain chemical equilibrium in their new rock 
environment. These reactions strongly alter 
element and isotope concentrations in the fluid 
and surrounding rocks, and thus chemically 
differentiate Earth’s materials, dictate element 
mobility and define global element fluxes. 

Understanding the reactivity of aqueous and 
carbon-containing fluids with minerals and 
rocks in Earth's deep interior has been a long- 
standing challenge for laboratory research 
and dynamic simulations. However, experi- 
mental advances~” in the past few years have 
allowed mineral solubilities and the stabilities 
of chemical species at extreme tempera- 
tures and pressures to be measured with 
unprecedented accuracy. In parallel, atomistic 
simulations** have delivered new calibrations 


Oceanic crust 


Suboceanic 
mantle 


Upper mantle 


Figure 1 | Fluid movement at subduction zones. Subducting slabs of 
oceanic lithospheric plates (which consist of oceanic crust and suboceanic 
mantle) are exposed to increasingly high temperatures and pressures. 
Minerals in the oceanic crust that bear volatile constituents, such as water or 
carbonates, break down, and mobile fluids are released. The buoyant fluids 
rise from the slab and encounter the slab-mantle boundary — the chemical 
interface between rocks of the oceanic crust and the upper mantle. They 


of the electrostatic properties of water at such 
temperatures and pressures. These emerging 
data are essential for constructing sets of 
thermodynamic data for aqueous chemical 
species, which are used in simulations to predict 
fluid-mineral reactions and element transfer 
in natural systems. Such predictive simulations 
typically involve more than six chemical com- 
ponents, multiple mineral phases and tens of 
aqueous species in the fluid®”. 

Nevertheless, obtaining robust information 
about the simultaneous exchange of multiple 
elements between minerals and fluids at Earth's 
extreme depths has not been possible — until 
now. Galvez and colleagues’ computational 

method is based on 


The authors the idea that the che- 
realized that mical potentials of 
equilibria major COmPOneRES 
betweenmineral °! she a. 

assemblages (such as 
coe onents and silica, aaa and 
lonic species can sodium oxide) are 
be expressed the same regardless 
using simple of whether they exist 
hydrolysis as aqueous species 
reactions. or as solids in rocks. 


Chemical potentials 
can be determined for all components within 
stable mineral assemblages, and dictate the 
chemical activity of each species and overall 
element concentrations in coexisting fluids 
at any pressure and temperature. The authors 
realized that equilibria between mineral com- 
ponents (oxides) and ionic species can be 
expressed using simple, independent hydroly- 
sis reactions, and used this as the basis of their 
computational approach. 


— Magma chamber 


—— Magma source 
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Galvez and co-workers‘ simulations reveal 
that, during subduction of Earth’s oceanic 
crust, minerals containing hydroxyl or car- 
bonate groups break down as the pressure 
and temperature increase, forming volatile- 
-free solid phases and a free fluid. The fluid 
progressively dissolves the mineral phases, and 
thus acquires diverse cations, which make it 
moderately alkaline. Minerals in peridotite 
rock in the upper mantle surrounding the sub- 
ducting slab also initially produce fluid that is 
alkaline, underpinned by the release of calcium 
cations from the mineral assemblage to the 
fluid. But when the temperature exceeds about 
500°C, the calcium cations become compatible 
with high-temperature minerals and therefore 
partition into the minerals from the fluid. The 
acidity of the aqueous fluid in the mantle thus 
increases, reaching weakly alkaline or near- 
neutral conditions. This phenomenon does 
not occur in the rocks of the subducting slab, 
and soa sharp acidity gradient is generated at 
the slab-mantle boundary. 

But how does the difference in acidity affect 
fluid reactivity and chemical modification 
across this major geological interface? Galvez 
and colleagues’ simulations show that infiltra- 
tion of slab-derived fluid into the overlying 
mantle causes highly nonlinear effects: a 
rapid increase in fluid pH, followed by a sharp 
decrease. The infiltrated rocks respond in a 
highly nonlinear way, and may reach the limit 
of their ability to buffer or moderate the pH 
changes after coming into contact with a cer- 
tain amount of fluid. This finding transforms 
our view of when the chemical composition of 
Earth’s major geological domains is modified 
and where these processes are localized. 


Continental crust 


Solid upper mantle 


| Aqueous fluids 


—— Slab-mantle boundary 


slab and mantle. 


Partially molten 
upper mantle 


then rise and cool until they become incorporated into magma. The cycling of 
global elements depends on the solubility of chemical components in aqueous 
fluid in the slab, the reactivity of the multi-component fluid at the slab-mantle 
interface, and the incorporation into magmas of fluids with their load of 
dissolved material. Galvez et al.' report thermodynamic simulations that 
provide insight into the geochemical processes occurring in the deep 
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The authors extended their predictive 
approach to fluids of even greater complexity 
that contain appreciable concentrations of 
ligand species, such as carbonates or chlorine. 
This allowed the chemical cycles of volatiles 
(including CO,) and feedback relationships 
between ligand behaviour and metal mobility 
to be addressed. 

The principal sources of carbonates at 
subduction zones are oceanic sediments and 
variably altered oceanic crust, whereas chlo- 
rine is derived from ocean water in the rock’s 
pore and fracture space. At temperatures of 
between 200 and 700°C, these ligands gradu- 
ally form negatively charged species in fluids, 
and so concentrations of cations — including 
hydrogen ions, H*, which determine pH — in 
the fluid rise to balance the negative charge. 
Aqueous fluids associated with any slab rock 
type will therefore probably be weakly alkaline 
to almost neutral. Many dissolution—precipi- 
tation reactions are acid-base exchange pro- 
cesses (they involve the exchange of metal ions 
by H’), and so this relative increase in fluid 
acidity will probably promote the solubility 
and mobility of metals. 

Earth’s redox state changes between the 
upper continental crust and mantle by more 
than eight orders of magnitude*”, and exerts 
additional controls on the distribution of spe- 
cies in aqueous fluids. For example, iron(111) 
species are much less stable and are less solu- 
ble in water than their iron(i1) counterparts, 
and CO, is more acidic than methane (CH,). 
The effects of these characteristics act in con- 
junction to lower the net charge in the deep 
fluid and promote an increase in acidity. This 
perturbation propagates through different 
chemical species, resulting in new shifts of 
chemical equilibria and feedback relationships. 
Incorporation of redox equilibria into Galvez 
and colleagues’ model will therefore probably 
allow previously unknown chemical gradients 
or interfaces to be identified in Earth’s dynamic 
subduction systems. = 
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Sleepy and dreamless 
mutant mice 


Sleep in mammals consists of non-rapid-eye-movement and rapid-eye- 
movement sleep. A large genetic screen reveals that these two sleep states are 
altered in mice by mutations dubbed Sleepy and Dreamless. SEE ARTICLE P.378 


DERK-JAN DIJK 
& RAPHAELLE WINSKY-SOMMERER 


ome people enjoy getting as much sleep 

as possible, whereas others claim to need 

only a few hours each night. Some peo- 
ple remember their dreams well, others don't. 
What is the genetic basis for these individual 
differences? In ‘forward’ genetic screens, genes 
are mutated at random and the effects of each 
mutation on a behaviour or biological trait 
of interest are assessed. On page 378, Funato 
et al.' report a large forward-genetic screen 
that identifies two genetic mutations that 
mediate variations in sleep patterns in mice. 

Forward genetics is considered a gold- 
standard technique for identifying and 
characterizing previously unknown roles 
for genes. But one key prerequisite is that the 
trait of interest can be accurately assessed 
(phenotyped). More than 20 years ago’, 
the phenotyping of circadian rhythms in 
mice through long-term recording of rest- 
activity cycles allowed researchers to use 
forward genetics to identify genes involved in 
these rhythms. But it has proved difficult to 
successfully apply forward genetics to the more 
complex process of mammalian sleep. 

Sleep is much more than simply rest. It 
consists of two states — non-rapid-eye-move- 
ment sleep (NREM sleep; also known as 
deep sleep) and rapid-eye-movement sleep 
(REM sleep; also known as paradoxical sleep 
and sometimes referred to as dream sleep). 
Accurate phenotyping of sleep is challeng- 
ing because it requires the identification and 
quantification of each of these sleep states 
through long-term, large-scale recordings of 
electrophysiological signals that measure brain 
waves and muscle activity. 

Funato and colleagues rose to this challenge, 
recording patterns of sleep for two consecutive 
days in more than 8,000 mice carrying genetic 
mutations that were randomly induced in 
one of their parents. One group of mice slept, 
on average, for about 15 hours a day — more 
than 3.5 hours longer than the average of all 
screened mice, and much longer than previ- 
ously documented for laboratory mice’. The 
authors, aptly, dubbed these mutant mice 
Sleepy (Fig. 1). 

The researchers showed that NREM sleep is 
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specifically altered in Sleepy mice. The mutants 
had deeper baseline NREM sleep and reacted. 
to sleep loss with more-intense NREM sleep 
than control mice. The mice responded to 
wake-promoting compounds such as caf- 
feine, and Funato et al. found no evidence of 
behavioural abnormalities, ruling out defects 
in wake-promoting neural circuits as the cause 
of increased sleep. There were also no obvi- 
ous abnormalities in the period of the animals’ 
circadian rhythms. This finding is interesting; 
alterations in genes that control circadian 
rhythms have previously been shown’ to affect 
sleep, but the current experiment demon- 
strates that sleep- and circadian-based effects 
can be separated. 

Funato and colleagues found that the 
mutation in Sleepy mice affects the Sik3 gene, 
which has not previously been implicated in 
sleep. A single-nucleotide mutation prevents 
the inclusion of the 13th protein-coding region 
(exon) of Sik3 in the SIK3 protein, presumably 
altering its activity. SIK3 is a protein-kinase 
enzyme that is expressed throughout the 
body and brain, and not just in known sleep- 
regulating areas — drawing a parallel with the 
clock genes that control circadian rhythms, 
which are expressed not only in the circadian 
master regulator in the brain, but also through- 
out the body. A strength of the paper is that 
the authors confirmed that Sleepy is indeed 
caused by skipping exon 13 of Sik3. By either 
introducing the same point mutation or using 
a refined genome-editing technology in mice, 
they recapitulated the Sleepy phenotype. Fur- 
thermore, they demonstrate that mutations in 
Sik3 also affect sleep-like behaviours in fruit 
flies and roundworms, suggesting an evolu- 
tionarily conserved function for this gene. 

From their screen, Funato et al. identified 
another group of mice, dubbed Dreamless, 
in which REM sleep time was reduced by an 
impressive 44% and occurred in shorter bouts 
than normal. It might be an overstatement to 
refer to these mice as Dreamless — some REM 
sleep remained, and humans also dream dur- 
ing NREM sleep’ (and some might doubt that 
mice experience dreams in the first place). 
Nonetheless, Dreamless mice have very little 
REM sleep compared with control mice’. The 
researchers demonstrated that the mutation in 
Dreamless mice affects the Nalcn gene, which 


Induce 


; mutation 
Xx 


Figure 1 | Screening sleep. Funato et al.' performed a ‘forward’ genetic 
screen to search for mutations in genes that modulate sleep behaviours in 
mice. In forward genetics, subtle mutations (red) are randomly introduced 
into the DNA of mice, the mutant mice are crossed with wild-type partners, 
and the offspring produced are analysed for changes in a behaviour of interest. 
In this case, the authors analysed sleep in more than 8,000 offspring by taking 


codes for an ion-channel protein and has pre- 
viously been implicated in circadian control of 
neuronal excitability in fruit flies®. 

In contrast to Sleepy, Dreamless mice have an 
altered circadian phenotype — the difference 
in activity levels between periods of rest and 
activity is reduced when these mice are sub- 
jected to constant darkness. This is of interest 
because REM sleep, but not NREM sleep, is 
under strong circadian control’. Finally, elec- 
trophysiologically assessed brain waves were 
altered in both mutants compared with con- 
trols, during REM and NREM sleep and dur- 
ing wakefulness. This finding, which implies 
that the mutations affect all vigilance states, 
could have been obtained only by the compre- 
hensive phenotyping conducted. 

What are the implications of this study? 
Sleep has multiple functions for the brain and 
the body, including regulation of learning, 
memory and metabolism®. Funato and col- 
leagues’ identification of these new mutants 
opens up avenues for identifying the mecha- 
nisms by which sleep exerts these functions. 
For instance, whether the Sleepy and Dreamless 
mutations cause alterations in these traits can 
be addressed immediately. 

The study might also have clinical implica- 
tions. One obvious question is whether muta- 
tions in SIK3 in people are implicated in a 
condition called idiopathic hypersomnia, in 
which people experience excessive sleepiness 
and sleep for long periods’. Another is whether 
mutations in NALCN have a role in increased 
age-related changes in cognitive decline, which 
in humans are predicted by reduced REM 
sleep’. 

It is notable that Funato et al. identified no 
mice that did not sleep. This is in contrast to 
circadian rhythmicity, which can be abolished 
bya single mutation. Perhaps too many genes 
are involved in sleep for a forward-genetic 
screen to produce a sleepless mammal. Or 


Mutant offspring 


Sleep 
electrophysiology 
screen 


maybe sleepless mice are not viable, meaning 
that these mutations cannot be identified. 
Overall, Funato and colleagues’ impressive 
study pays tribute to the importance and com- 
plexity of sleep by combining comprehensive 
phenotyping with state-of-the-art genetic and 
molecular tools. This strategy has unveiled new 
mutations that change the time spent in the two 
sleep stages that all mammals experience. = 
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Mutations 
identified 


> Sik3 


Sleep mutants identified 


Dreamless 


electrophysiological measurements of brain waves and muscle activity, and 
identified two groups of mice, dubbed Sleepy and Dreamless, that showed 
increased non-rapid-eye-movement (NREM) sleep and decreased rapid- 
eye-movement (REM) sleep, respectively. Subsequent genome sequencing 
revealed that the mutation in Sleepy mice affects a gene called Sik3; the 
mutation in Dreamless mice affects the gene Nalcn. 
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Semiconductors that 
stretch and heal 


Polymeric semiconductors have been prepared whose molecular properties make 
them stretchable and healable — a milestone in the development of sophisticated 
organic electronic surfaces that mimic human skin. SEE LETTER P.411 


SIEGFRIED BAUER & 
MARTIN KALTENBRUNNER 


soft substances that have the ability 

to stretch, and to heal when damaged. 
By contrast, everyday electronic appliances 
are made mostly of hard, brittle materials 
incapable of self-repair. Scientists desire to 
narrow this gap between the animate and 
digital worlds; this will require new approaches 


LL organisms are made largely from 


in materials science. On page 411, Oh et al.' 
report their use of non-covalent-bonding 
mechanisms to make high-performance 
organic semiconductors that are intrinsically 
stretchable and healable. This allowed the 
authors to fabricate transistors — elementary 
electronic components used in amplifiers 
and logic circuits — entirely from stretchable 
materials in thin films, potentially opening the 
way to new generations of wearable electronic 
devices. 
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Figure 1 | Molecular design for an intrinsically stretchable and healable 
organic semiconductor. Oh et al.’ have prepared a polymer that contains 
semiconducting crystalline domains (blue) and flexible, amorphous polymer 
chains crosslinked by hydrogen bonds. When the material is subjected to 
strain, the chains reversibly elongate and the hydrogen-bond network breaks 


In his 1940 book The Nature of the Chemical 
Bond and the Structure of Molecules and Crys- 
tals, the chemist Linus Pauling beautifully 
described the non-covalent origin of hydro- 
gen bonds’: “It has been recognized in recent 
years that under certain conditions an atom of 
hydrogen is attracted by rather strong forces to 
two atoms, instead of only one, so that it may 
be considered to be acting as a bond between 
them.” Being weaker than covalent bonds, 
hydrogen bonds easily rearrange themselves 
within networks of macromolecules, a fact that 
has been exploited as a self-healing mechanism 
in polymers’. 

By contrast, hydrogen bonds were thought 
to be detrimental in organic semiconductors. 
Research into organic electronics in the 1990s 
focused mostly on m-conjugated molecules, in 
which electrons are highly delocalized — an 
important characteristic for efficient charge 
transport in a semiconductor. Intermolecu- 
lar hydrogen bonds were expected to inter- 
rupt electron delocalization. But in the past 
few years, it has emerged that the possible 
structural basis for building organic semi- 
conductors is much larger than was previously 
anticipated’, and, particularly surprisingly, that 
charges can move effectively in materials that 
contain intermolecular hydrogen bonds’. 

Oh et al. have taken that surprising finding 
to a new level, by incorporating hydrogen 
bonds into polymers not only to make the 
materials tolerant to stretching, but also to 
allow impressive charge transport. They 
achieved this by tailoring the molecular 
structure of semiconducting polymers to 
incorporate crystalline parts, which are 
responsible for charge transport, and amor- 
phous regions crosslinked through hydrogen 
bonding, which take up mechanical strain 
without greatly impairing the material’s 
electronic properties (Fig. 1). The authors 


Tightly tangled 
polymer 


report that films of these polymers crack in 
response to severe elongation, but that the 
cracks can be healed by treating the materials 
with solvent vapour and heat. The healed poly- 
mers almost completely recover their initial 
electronic behaviour. 

So far so good, but the next step is to use 
the materials to make a stretchable elec- 
tronic device. Three general strategies have 
been developed to achieve this goal. The first 
involves placing rigid semiconductors on small 
flexible ‘islands’ embedded in or placed on 
elastomers (rubbery materials) that are con- 
nected by stretchable wires®. This approach 
also allows high-performance, off-the-shelf 

microelectronic 


The use of devices to be inte- 
materials grated onto stretch- 
that can heal able materials. 

themselves adds The second strat- 
anew feature egy is to laminate 
to intrinsically thin foils containing 
stretchable flexible inorganic’ 
electronic or organic® elec- 
devices. tronic components 


onto pre-stretched 
elastomers. When the elastomer is relaxed, 
the laminated system wrinkles, thus allow- 
ing the device to be subsequently re-stretched 
in the direction that the elastomer was 
first extended. 

The third approach takes advantage of 
stretchability at the molecular level, either 
by using percolating networks of nanostruc- 
tured conductors or semiconductors — typi- 
cally tubes or wires — in elastomers’, or by 
incorporating softer materials into a stiff 
semiconducting polymer without disrupt- 
ing charge-transport pathways’’. Oh and 
colleagues’ strategy falls into this latter sub- 
category, but the authors’ use of materials 
that can heal themselves adds a previously 
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Hydrogen 
bonds 


and re-forms. This allows the material to stretch and unstretch without greatly 
compromising the semiconducting behaviour. Because broken hydrogen 
bonds easily re-form, cracks in the bulk material can be healed when heated 
and treated with solvent vapour. The material was used to make a wearable 
transistor that tolerated stretching when mounted on human limbs. 


unavailable feature to intrinsically stretchable 
electronic devices. 

Oh et al. prepared stretchable transistors 
from their polymers as proof-of-concept 
devices, and observed that the devices main- 
tained their electrical performance even after 
500 stretching cycles at strains typical of most 
practical applications. Remarkably, the authors 
found that the transistors also maintained 
high charge-carrier mobility when mounted 
on human limbs and subjected to various 
common movements, such as twisting of the 
hand, folding of the arm and stretching of the 
elbow — as needed for wearable electronic 
devices. 

It should be noted, however, that the 
authors’ transistors are not as durable or 
stretchable as state-of-the-art devices based 
on islands and wrinkles, leaving plenty of 
room for future development. Moreover, the 
voltage required to operate the new transistors 
is large (of the order of tens of volts), which 
lowers the energy efficiency of the devices and 
is thus a concern for autonomously operating 
wearables. Smaller voltages will be essential 
for electronic devices that contact the human 
body. The voltage problem might be alleviated 
by greatly reducing the transistors’ dimen- 
sions — most notably, the thickness of the 
electrically insulating layer between the con- 
trol electrode and the transistor’s semiconduc- 
tor — although this would be challenging to 
achieve. It should also be noted that heating for 
prolonged periods with solvent vapour may 
be an impractical means of healing devices 
worn on human skin. 

Perhaps the ultimate goal for scientists 
developing flexible electronics is to produce 
something that behaves like human skin: the 
stretchable organ that covers and protects our 
bodies, enables us to feel touch, pain and tem- 
perature, and triggers a healing process when 


wounded. A perfect technological imitation 
may not be possible, but Oh and colleagues’ 
work is a milestone in the search for elec- 
tronic skins that behave much like their 
archetype. In the shorter term, healable soft 
electronic devices hold promise for truly 
bionic and smart electrical appliances, and 
might revolutionize future generations of 
wearables. m 
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Deep-sea secrets of 
butane metabolism 


Anaerobic microbes have been found to break down the hydrocarbon butane by 
a pathway with some similarities to anaerobic methane breakdown. Harnessing 
the butane pathway might enable biofuel generation. SEE ARTICLE P.396 


STEPHEN W. RAGSDALE 


icrobes living in anaerobic conditions, 
M such as in deep-sea sediments, 
need a way of generating energy that 
does not require oxygen gas. Some anaerobic 
microbes do this by breaking down methane 
to form carbon dioxide’. Others’ obtain energy 
by metabolizing the hydrocarbon butane 
(C,H,)), but the butane-breakdown pathway 
has until now been unknown. Determining 
how butane is metabolized might be useful 
for making biofuels or for developing effi- 
cient catalysts for hydrocarbon activation (the 
process of breaking a hydrocarbon’s chemical 
bonds). On page 396, Laso-Pérez et al.’ report 
the pathway for microbial anaerobic break- 
down of butane to CO,. The authors identify 
some butane-breakdown steps ripe for further 
investigation, and observe that some similar 
steps have evolved in the microbial processes 
for degrading methane and butane. 
Laso-Pérez and colleagues obtained deep-sea 
sediment samples from the Gulf of California 
and isolated fractions that could degrade 
butane. The butane degradation they observed 
depended on a microbial consortium formed 
of Candidatus Syntrophoarchaeum archaea 
and HotSeep-1 bacteria, two types of prokary- 
ote (organisms lacking a nucleus). The authors 
found that the microbial consortium can also 
metabolize propane (C,H,), although it is 
not known whether propane and butane are 


degraded by the same enzymes or by different 
enzymes that have specific substrate prefer- 
ences. However, the consortium does not 
degrade methane, nor several other short-chain 
hydrocarbons the authors tested. 

Butane consists of carbon and hydrogen 
atoms linked by strong single bonds, and 
the molecule lacks a reactive group. The 
stability of the carbon-hydrogen (C-H) bond 
poses a formidable and interesting chemical 
challenge in butane breakdown. For anaero- 
bic microbial breakdown of methane, the 
thermodynamically unfavourable hydro- 
carbon breakdown proceeds by coupling it to 
a thermodynamically favourable reaction: for 
example, the transfer of electrons to sulfate to 
form hydrogen sulfide (H,S, a reduced form of 
sulfate)**, Laso-Pérez and colleagues investi- 
gated whether such thermodynamic coupling 
might occur in their microbial consortium, 
and found evidence for sulfate reduction that 
depended on butane breakdown. In anaerobic 
methane degradation, sulfate reduction occurs 
by direct interspecies electron transfer** from 
the archaea to their bacterial partner Hot- 
Seep-1. Laso-Pérez and colleagues find no 
evidence for sulfate-reduction genes in the 
archaea they studied, and propose that an 
interspecies electron-transfer mechanism also 
occurs in the butane-degradation pathway. 

The authors determined experimentally 
that the overall butane-breakdown reaction 
consists of the conversion of butane, sulfate 
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50 Years Ago 


Several oncogenic viruses have 
been known and studied for some 
years and it is therefore important 
to search for viruses as aetiological 
agents in human cancer ... Aman 
36 years old and of previous good 
health was referred to one of us... 
with clinical evidence of hepatic 
malignancy. The presence of primary 
hepatoma had been confirmed at 
laparotomy. A liver biopsy specimen 
was obtained by means ofa needle ... 
Histological examination showed 
the presence of poorly differentiated 
hepatocarcinoma ... In addition, 
particles were found in the cytoplasm 
of some 90 per cent of the eighty 
tumour cells examined ... Their 
size and shape indicate that these 
particles are viral in nature. It has 
recently been suggested, however, 
that particles seen as viruses under 
the electron microscope in various 
human leukaemia states may have 
been mycoplasma, small free living 
micro-organisms with many virus- 
like properties. The true nature of 
the particles we have seen must 
therefore await identification. 

From Nature 19 November 1966 


100 Years Ago 


Judging from the correspondence 
which has recently appeared in 
NAaTurgE, the dearth of wasps this 
autumn in many parts of England 
has been most pronounced ... 

I was staying, with my wife and 

son, in a cottage on Christon 

Hill, on the Mendip Hills ... we 
were simply besieged by wasps, 
which were particularly tiresome 

at the breakfast-table. They were 

so numerous at times ... that we 
frequently had to abandon our meals 
temporarily to punish the offenders, 
my son continuing his captures 
often for halfan hour at a time... I 
cannot recollect ever having seen so 
many wasps in a house, unless it was 
during the hottest part of 1911. 
From Nature 16 November 1916 
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Candidatus Syntrophoarchaeum 


Hydrocarbon 
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CoA ——> 2H,C 
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Four steps Reverse Reverse 
of fatty-acid Wood-Ljungdahl methanogenesis by 
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CoA ———> 2CO, + 2H,C —H,MPT —————> 200, 
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| 
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Thermodynamically 


Figure 1 | Proposed pathway for butane breakdown. Laso-Pérez et al.° 
propose a pathway for the anaerobic breakdown of butane in a microbial 
consortium formed between Candidatus Syntrophoarchaeum archaeal 
microbes and HotSeep-1 bacteria. Although the overall reaction in 
Candidatus Syntrophoarchaeum is thermodynamically unfavourable, the 
reaction proceeds by coupling to a thermodynamically favourable reaction in 
HotSeep-1 bacteria. The initial butane cleavage, a step known as hydrocarbon 
activation, creates a butane derivative bound to coenzyme M (butyl-CoM). 
The authors propose that butane activation is catalysed by an enzyme similar 
to methyl coenzyme M reductase (MCR). Through unknown steps, the 
intermediate butyl-CoM is converted to butyryl-CoA, which then undergoes 


favourable 


four steps of fatty-acid oxidation to form acetyl-CoA. Wood-Ljungdahl 
pathway’ enzymes convert acetyl-CoA to carbon dioxide and a methyl 
group bound to the coenzyme tetrahydromethanopterin (H,MPT). This 
methyltetrahydromethanopterin derivative is then converted by Wolfe-cycle® 
enzymes to CO, ina process related to the methane degradation pathway 
known as reverse methanogenesis'. In the microbial butane-degradation 
pathway, the Wood-Ljungdahl pathway enzymes and Wolfe-cycle enzymes 
act to degrade, rather than synthesize, their normal products. These enzyme 
reversals occur when these reactions are coupled to the thermodynamically 
favourable reduction of sulfate to hydrogen sulfide, which occurs through 
interspecies electron transfer to HotSeep-1 bacteria. 


and hydrogen ions to final products of CO,, 
HS and water. On the basis of clever identi- 
fication of genes that might have a role in the 
butane-degradation pathway and determina- 
tion of possible intermediate compounds and 
cofactors, Laso-Pérez et al. propose a complete 
degradation pathway from butane to CO, in 
Candidatus Syntrophoarchaeum. The authors 
identify some key stages in the degradation of 
pathway intermediates (Fig. 1). 

The first step in butane degradation involves 
the breakage of one of butane’s C-H single 
bonds. In methane degradation, C-H bond 
activation is catalysed by the enzyme methyl 
coenzyme M reductase (MCR), resulting in 
the formation of a methyl group bound to 
coenzyme M (CoM)*. Laso-Pérez et al. tested 
whether CoM might also be associated with 
butane breakdown. Using mass spectrometry, 
they identified a butane derivative bound to 
CoM (butyl-CoM) in their microbial sample. 
The authors also observed that a CoM-mimic 
molecule, bromoethanesulfonate, inhibits 
butane degradation. These results strongly 
implicate the catalytic action of an enzyme 
similar to MCR in the initial C-H bond 
cleavage of butane. 

The next stage of the breakdown process, 
the multi-step conversion of butyl-CoM to 
a derivative bound to coenzyme A (CoA), 
known as butyryl-CoA, constitutes the most 
notable mechanistic uncertainty in the path- 
way described by Laso-Pérez and colleagues. 
During this sequence, a butane-derived group 
is transferred from CoM to the ubiquitous 
CoA. One option is that a butyltransferase 
enzyme known as Mta catalyses this reaction 
by transferring the butyl group to CoA, which 
generates butyl-CoA. However, it seems more 


probable that butyl-CoM undergoes oxidation 
to generate butyryl-CoM, and that the CoM 
is subsequently displaced by CoA to generate 
butyryl-CoA. 

The authors propose that fatty-acid oxidation 
steps convert butyryl-CoA to the common 
metabolic intermediate acetyl-CoA. Then 
Wood-Ljungdahl pathway’ enzymes oxidize 
acetyl-CoA, resulting in the formation of CO, 
and a methyl group bound to the coenzyme 
tetrahydromethanopterin (H,MPT). The 
final steps of the pathway involve the action 
of Wolfe-cycle enzymes’, which oxidize the 
methyltetrahydromethanopterin, resulting in 
the formation of CO,. 

An enzyme is usually thought of as 
catalysing a directional reaction that trans- 
forms molecule A into molecule B. However, 
an enzyme actually catalyses a reaction in a 
state of equilibrium, and if the equilibrium 
changes, the enzyme can run in reverse and 
transform molecule B into molecule A. Two 
of the key metabolic modules in the pro- 
posed butane-degradation pathway involve 
reactions that are best known for running 
in the opposite direction. In the butane- 
degradation process, Wood-Lungdahl enzymes 
degrade acetyl-CoA, although they usually 
synthesize acetyl-CoA. Similarly, during 
butane degradation, Wolfe-cycle enzymes 
uncharacteristically degrade rather than syn- 
thesize a methyl group. This reverse action of 
Wolfe-cycle enzymes is also observed during 
methane degradation by anaerobic microbes in 
the process known as reverse methanogenesis’. 

A particularly interesting enzyme-reversal 
question arising from this research is whether 
any ‘butanogenic’ microbes might naturally 
exist, or could be engineered, that could run 
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the entire butane-degradation pathway in 
reverse, thereby generating butane biofuel. 
A butane-synthesizing reaction would be 
thermodynamically favourable, and would not 
require coupling to sulfate reduction. 
Laso-Pérez and colleagues’ analysis of the 
anaerobic consortium that degrades butane 
provides a glimpse into a pathway for the 
oxygen-independent breakdown of short- 
chain hydrocarbons. The authors provide 
evidence that butane oxidation is linked to 
sulfate reduction. It will be interesting to inves- 
tigate whether butane oxidation is coupled to 
other types of reduction process, such as the 
reduction of nitrate or metal that occurs in 
reverse methanogenesis’. Further biochemi- 
cal studies of butane metabolism might lead 
to major advances in the design of efficient 
catalysts for hydrocarbon activation, or enable 
the use of CO, to generate butane or other 
intermediate compounds in this pathway. = 
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The effects of A’-tetrahydrocannabinol 
on the dopamine system 


Michael A. P. Bloomfield)?:3+>, Abhishekh H. Ashok!?:, Nora D. Volkow® & Oliver D. Howes!?:4 


The effects of A®-tetrahydrocannabinol (THC), the main psychoactive ingredient in cannabis, are a pressing concern 
for global mental health. Patterns of cannabis use are changing drastically owing to legalization, the availability of 
synthetic analogues (commonly termed spice), cannavaping and an emphasis on the purported therapeutic effects 
of cannabis. Many of the reinforcing effects of THC are mediated by the dopamine system. Owing to the complexity of 
the cannabinoid-dopamine interactions that take place, there is conflicting evidence from human and animal studies 
concerning the effects of THC on the dopamine system. Acute THC administration causes increased dopamine release 
and neuron activity, whereas long-term use is associated with blunting of the dopamine system. Future research must 
examine the long-term and developmental dopaminergic effects of THC. 


annabis is a widely used recreational drug. Over half of young 

Americans have used the drug’, whereas in Europe, cannabis has 

now overtaken heroin as the most widely reported illegal drug 
used amongst people entering specialist addiction services”. This provides 
the background to political debates worldwide concerning changes to 
the legal status of the drug. Although causality has not been conclusively 
demonstrated, heavy cannabis use is associated with increased risk of 
mental disorders® including psychosis‘, addiction”, depression®, suicid- 
ality’, cognitive impairment® and amotivation’. 

THC, the main psychoactive component of cannabis”, elicits its 
acute psychoactive effects through the endocannabinoid type 1 receptor 
(CB,R)!!. THC has been linked to the rewarding aspects of cannabis and 
the induction of symptoms of mental illnesses and cognitive impairment. 
Recently, the THC content of cannabis has been increasing”” and synthetic 
THC analogues (potent cannabinoid receptor agonists; termed ‘spice’) are 
now widely used’. The likelihood of a future increase in consumption 
of cannabinoids through electronic cigarettes (cannavaping) and edible 
products! changes the landscape further!>. Given the widespread use of 
cannabinoids, and the links between THC exposure and adverse outcomes, 
it is imperative that the neurobiological effects of THC are understood. 
Recently, we and others have found that heavy cannabis use is associated 
with reductions in dopaminergic function. Since both the rewarding and 
psychotogenic effects of THC and its analogues are thought to be mediated 
by the dopaminergic system, demonstrating dopaminergic alterations in 
human users in vivo is of clinical relevance for the prevention and treat- 
ment of cannabis-use-associated disorders and psychoses. Therefore, we 
present a review of the animal and human literature on the complex effects 
of acute and longer-term THC administration on dopamine synthesis and 
release and on its receptors, providing an account of the current under- 
standing of the field. We critically analyse the factors that contribute to 
the effects of THC, and to variations between studies, before providing a 
framework for future research including a pharmacological dissection of 
these effects, especially in the developing brain. 


THC receptor binding in the brain 
THC is a CB,R and endocannabinoid type 2 receptor (CB2R) partial 
agonist!’. The psychoactive effects of THC are blocked by the CB, R antagonist 


rimonabant!®"’, indicating that these effects are mediated through the 
activation of G-protein-coupled CB;R receptors, which reduce cAMP 
levels by inhibiting adenylate cyclase!®. THC disrupts finely tuned endo- 
cannabinoid retrograde signalling systems owing to the temporal and 
neuronal specificity of endocannabinoids over THC. Under conditions 
of low CB,R density, THC antagonizes endogenous agonists that possess 
greater receptor affinity than THC’. THC also allosterically modulates 
opioid receptors”’, which may provide additional indirect routes for 
altering dopamine transmission”!. Furthermore, THC has psychoactive 
metabolites with CB,R affinity, further complicating the analysis of recep- 
tor-binding studies”. 


CB, receptors and dopamine 

Early animal studies described the interactions of amphetamine, which 
increases dopamine release, and THC”. These studies reported that 
the behavioural effects of amphetamine were potentiated or antago- 
nized, depending on the dose of THC, leading researchers to propose 
that dopamine was “a prime candidate for...the mode of action of 
A?-tetrahydrocannabinol”™. Indeed, THC produces complex effects on 
the dopamine system, contributing to the drug's recreational and harmful 
effects. However, there are inconsistencies between the preclinical and 
clinical findings, which have proven challenging for researchers in the 
field. It is, therefore, timely to review the evidence and provide a frame- 
work for understanding the inconsistencies between the preclinical and 
clinical findings. 

Dopaminergic neurons are modulated by the endocannabinoid 
system’. CB,Rs and the endocannabinoid ligands anandamide and 
2-arachidonoylglycerol (2-AG) are abundant in dopaminergic path- 
ways, including in the striatum”®, where they act as a retrograde feedback 
system on presynaptic glutamatergic and j-aminobutyric acid 
(GABA) nerve terminals (Fig. 1), modulating dopamine transmission. 
Anandamide”’ and 2-AG”* stimulate dopamine release in the nucleus 
accumbens (NAc) shell. Again, this effect is blocked by the CB, antagonist 
rimonabant, indicating that the dopaminergic effects of endocannabi- 
noids involve CB, receptors. The rewarding properties of THC, which are 
mediated through increased dopamine release and dopaminergic neuron 
firing, are underpinned by biased signal transduction mechanisms from 
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Figure 1 | THC binds to CB1 receptors on glutamatergic and 
GABAergic neurons, disrupting normal endocannabinoid retrograde 
signalling from dopaminergic neurons’*°. Endocannabinoids influence 
synaptic signalling in the midbrain. 2-AG is synthesized by diacylglycerol 
lipase in dopaminergic midbrain neurons and, once released, acts 
retroactively on CB,Rs on nearby glutamatergic and GABAergic terminals. 
CB Rs mediate robust inhibition of GABA inputs onto midbrain 
dopamine cells'*°, termed retrograde suppression of inhibition. CB\Rs 
are also localized on glutamatergic terminals that synapse onto midbrain 
dopamine neurons!*’, where endocannabinoids mediate retrograde 
suppression of excitation. Thus, endocannabinoids fine-tune the activity 
of the mesolimbic dopamine projections through the modulation of both 
excitatory and inhibitory signalling and THC disrupts this system. 


the CB,R!°. There is evidence that acute and chronic THC exposure have 
differing effects on the dopaminergic system. We will, therefore, deal with 
these separately and describe their effects on different neurobiological 
components of the dopaminergic system, including neuron firing, syn- 
thesis, release, reuptake and receptors. 


Acute THC and presynaptic dopamine in animals 
It has long been clear that THC exerts complex effects on the dopamine 
system. Early in vitro studies in rodent cells, using radiolabelled dopa- 
mine in synaptosomes, found that treatment with THC caused increased 
dopamine synthesis” and release** (Fig. 2 and Box 1). However, the 
effects on dopamine uptake yielded conflicting results, with evidence of 
both increases*® and dose-dependent decreases”*. Subsequently, bipha- 
sic and triphasic effects of THC were discovered, whereby low doses of 
THC produced increases in the conversion of tyrosine to dopamine, but 
high doses of THC resulted in decreased dopamine synthesis*". Similarly 
complicated temporal relationships between THC administration and 
changes in dopamine levels were observed’, with repeated dosing 
resulting in behavioural and neurochemical tolerance—highly pertinent 
to the mechanisms of dependence to the drug. The complex dose- 
specific effects of THC in rodents were thought to be due to dose-related 
decreases in precursor uptake*” and dopamine-opioid interactions via 
\-opioid receptors”. 

Subsequent work investigated THC-induced increases in dopamine 
synthesis in vivo. THC increased *H-dopamine synthesis**4, tyrosine 
hydroxylase protein levels*> and mRNA expression*®, the rate-limiting 
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Figure 2 | Summary of the acute effects of THC on dopaminergic 
function. In animal models, acute THC challenge is associated with 
increased dopaminergic cell firing, and increased dopamine synthesis 
and release. 


step in the dopamine synthesis pathway. Similarly, increases in dopa- 
mine metabolism, measured through the dihydroxphenylacetic 
acid/dopamine (DOPAC/DA) ratio, were reported in most*”, but 
not all*®, rodent studies. However, the majority of early studies using 
spectrophotofluorimetry were inconsistent due to technical limitations 
in detecting the rapid changes in extracellular dopamine concentration, 
deviations that are now detectable using contemporary microdialysis 
techniques*®. 

In vivo microdialysis shows that acute THC administration increases 
dopamine efflux in the prefrontal cortex (PFC)*®, striatum*® and NAc”. 
Only one study did not find that THC induced increases in dopamine 
efflux”, an anomaly that may have been related to the route of admin- 
istration as that study used a THC gavage—the other studies used 
intravenous injection, which produces a rapid increase in THC which 
reaches the brain promptly, whereas gavage favours sequestration in 
lipid compartments owing to the very high lipid solubility of THC*®. 
Electrophysiological studies in rats have categorically demonstrated that 
THC dose-dependently increases firing rates in ascending midbrain 
dopaminergic projections via CB,R agonism*™*, Together, these find- 
ings suggest that THC increases the firing rates of dopamine neurons, 
leading to increased dopamine synthesis and release in terminal fields. 


Acute THC and postsynaptic dopamine in animals 
Acute THC administration did not alter dopamine receptor protein lev- 
els in rhesus monkeys’. In the rat limbic forebrain, one study reported 
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* Synaptosome: a homogenized mixture of isolated synaptic terminals 


Glossary of methods used to assess dopaminergic activity after THC 


* High-performance liquid chromatography (HPLC): a technique to separate, identify and quantify a particular chemical from a mixture 
+ In vivo electrophysiology: a method of studying single-neuron responses, using microelectrodes to record from cells, including in the living animal brain 
* Microdialysis: a technique used for the continuous measurement of neurochemicals and metabolites through the insertion of a probe into a specific area 


* Positron emission tomography (PET): an imaging technique that uses a positron-emitting radiotracer that binds to specific proteins including receptors, 
enzymes and transporters. The gamma rays produced by the radionuclide allow quantification of proteins in living tissue non-invasively with very high 


* Single-photon emission computed tomography (SPECT): an imaging technique similar to positron emission tomography, but with lower spatial resolution 


increased dopamine type 1 receptor (DR) availability** whereas other 
studies reported decreases**. In the striatum, the density of dopamine 
type 2 receptors (D2Rs) either decreased” or did not significantly change, 
with decreases in D)R also reported**. That there is no change in recep- 
tor protein levels, together with the tendency towards reduced receptor 
availability, almost certainly reflects THC-induced changes in synaptic 
dopamine levels. 


Acute THC and dopamine in humans 

Studies of brain activity in humans, using functional magnetic resonance 
imaging (fMRI) and positron emission tomography (PET), provide indi- 
rect measures of dopaminergic function through changes in cerebral 
blood flow and glucose metabolism. These serve as surrogate markers of 
brain activity in areas with dopaminergic projections. In humans, acute 
THC administration is associated with increased activity in frontal and 
sub-cortical regions*’. However, as the CB;R has the highest concentra- 
tions in these regions”! these findings may be due to direct effects on the 
endocannabinoid system rather than representing dopamine-mediated 
processes. When studies investigating the effects of acute THC admin- 
istration on resting brain activity have focused on regions with dense 
dopaminergic innervation, such as the striatum, there have been incon- 
sistent findings, with reports of both increased and reduced activity’. 
However, certain cognitive tasks are modulated by dopaminergic sig- 
nalling and may provide a more robust proxy for THC-induced changes 
in dopaminergic transmission. For example, motor response inhibition 
is associated with cortical dopamine release; an {MRI study of healthy 
humans with previous exposure to cannabis found that THC attenuates 
activation in the right inferior frontal cortex and the anterior cingulate 
cortex (ACC) during suppression of motor inhibition. Further indirect 
evidence of blunted dopaminergic processing comes from a study 
of healthy humans with previous exposure to cannabis using a verbal 
working memory task in which it was found that THC attenuated striatal 
activation®*. Additionally, a study of reward function in occasional 
cannabis users found that THC induced widespread attenuation of the 
brain response to feedback in reward trials** but not under reward- 
anticipation conditions”. 

It is also possible to directly measure activity of the dopamine system 
using molecular imaging. These studies have examined the effect of acute 
THC administration on dopamine release in humans with previous expo- 
sure to cannabis in vivo. Using PET, a combined analysis®° of two previous 
studies has shown that THC does indeed cause dopamine release in the 
ventral striatum in the human brain’. Likewise, acute THC challenge 
elicited dopamine release in fronto-temporal cortical brain regions”, 
although this finding requires replication with radiotracers that show 
higher affinity for cortical dopamine receptors. However, in a separate 
study using single-photon emission-computed tomography (SPECT), 
no significant THC-induced dopamine release was observed?’, which 
may be due to the relatively small sample size (n = 9) and the lower 
sensitivity of SPECT than PET to dopamine measures. This is likely to be 
because THC-induced dopamine release in the striatum appears to be of 


a lower magnitude than that caused by other drugs such as amphetamine 
and methylphenidate*’, combined with difficulties in imaging dopamine 
changes that are comparatively small*. 


Repeated THC treatment and presynaptic dopamine 
Studies of repeated THC dosing have yielded complex, region-specific 
effects, such as increased total striatal dopamine levels in rats® but 
reduced hippocampal dopamine levels in mice®!. Reduced dopamine 
metabolism was found in the medial PFC in two rat studies, an effect 
that was not observed in the striatum or the NAc”. Administration 
of THC for 21 days downregulated tyrosine hydroxylase mRNA expres- 
sion in the substantia nigra and ventral tegmental area (VTA) midbrain 
nuclei, but not in the cortex or striatum®’. Several studies have reported 
that multiple THC doses do not significantly change dopamine release 
in the NAc in rats“. However, this may be due to differing dopaminergic 
responses within the NAc; in one rat study, repeated THC doses led to 
increased dopamine release in the NAc core but decreased release in 
the NAc shell®. The picture is further complicated by genetic strain 
effects, with increased dopamine release in the NAc observed in Lewis, 
but not Fischer 344, rat strains. Together, these studies indicate that 
repeated THC dosing produces region-specific effects on dopamine 
function. 


Studies in human cannabis users 

Several molecular imaging studies of dopaminergic function have 
been conducted in human cannabis users. Using PET, the capacity for 
dopamine synthesis was observed to be reduced in cannabis users®’. 
Notably, this reduction was driven by users meeting clinical criteria 
for abuse or dependence and was related to the degree of cannabis use 
(Fig. 3). Similarly, in two separate studies, cannabis users exhibited 
reduced dopamine release to a stimulant challenge, which was related 
to degree of cannabis use®’, and cognitive deficits that included poor 
working memory®. Since no alteration in amphetamine-induced dopa- 
mine release was seen in recently abstinent cannabis users”, this effect 
is likely to be related to active use of the drug. Although chronic use was 
not associated with alterations in stress-induced dopamine release’!, 
there was evidence of a positive relationship between the duration of 
cannabis use and the level of stress-induced dopamine release in the 
limbic striatum7!. Likewise, recent data show that cannabis users have 
an attenuated metabolic response to methylphenidate challenge in 
the striatum, with a negative relationship between methylphenidate- 
induced metabolic increases and severity of cannabis use”. There is also 
evidence of reduced dopamine transporter (DAT) density in chronic 
cannabis users’*. Although the interpretation of some of these studies 
is complicated by the fact that cannabis users also frequently smoke 
tobacco, a recent experiment has addressed this by studying cannabis 
users without comorbid substance dependence, showing that cannabis 
users do indeed exhibit reduced dopamine release®. Overall there is 
converging evidence that presynaptic dopaminergic function is reduced 
in cannabis users. 
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Figure 3 | Cannabis use in humans is associated with reduced dopamine 
in the striatum. PET studies have shown lower striatal dopamine 
synthesis and release capacity in cannabis users. Lower dopamine 
synthesis capacity in the dorsal striatum is directly associated with 
reduced motivation levels!®’, whereas reduced dopamine release in the 
ventral striatum is directly associated with negative emotion levels and 
severity of addiction®™. 


Repeated THC administration and postsynaptic 
dopamine 

Studies in rats have reported that multiple THC dosing results in 
increased D)R availability in the midbrain, striatum and PF C74 and 
that this is associated with dopamine receptor sensitization. There is 
further evidence of downstream dopaminergic effects of THC; one 
study in rats’° reported upregulated postsynaptic dopamine recep- 
tor signalling in the NAc through an increase in adenylyl cyclase 
activity, which was proposed to underlie THC-induced changes in 
amphetamine-induced locomotive behaviour. These findings suggest 
that repeated THC dosing results in altered dopamine receptor signal 
transduction. 

Dose-dependent increases in burst-firing in the VTA in response to 
multiple THC administrations have been reported in nearly all murine 
electrophysiological studies*®, with only one exception of which we are 
aware’®, In the substantia nigra pars compacta (SNpc), multiple THC dos- 
ing was associated with increased neuronal firing, although this increase 
was smaller in magnitude than in the VTA”””*. This suggests that VTA 
and SNpc dopamine neurons develop differing responses to repeated 
THC exposures. There also appears to be an effect of withdrawal from 
multiple THC doses, whereby decreased firing is elicited by the abrupt 
cessation of repeated THC or administration of the CBR antagonist 
SR141716A (ref. 79). 

Recent studies in current cannabis users and abstinent former users 
have found no significant difference in striatal DR availability compared 
to individuals with no history of chronic cannabis use®8-7.80-82 However, 
there was an inverse relationship between the age of first cannabis use and 
DR availability in one study” and an inverse relationship between cur- 
rent cannabis use and receptor availability in another study*, suggestive 
of possible dose effects or susceptibility to drug use. 


THC exposure and dopamine cell morphology 

In addition to changes in cell function, there is evidence from studies 
in rodents that THC exposure causes abnormalities in the structure of 
dopamine neurons. These morphological effects are region-specific and 
include reductions across a range of measures of neuronal cell size in 
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the VTA®*4 and increased neuronal arborization in the NAc shell and 
frontal cortex®. 


Developmental THC exposure and dopamine 

Adolescence is an important time for brain development and adolescent 
cannabinoid exposure has consequences in adulthood®®. The endocan- 
nabinoid system plays an important role in brain development*’; devel- 
opmental THC exposure produces complex alterations in the dopamine 
system that are apparent from an early stage. For example, gestational 
THC exposure is associated with increased fetal brain tyrosine hydrox- 
ylase mRNA expression in rats** and changes in dopamine receptor 
gene expression. THC exposure in early development is associated with 
increased cortical D2R availability®’. In rats, exposure to THC from ges- 
tational day 5 to post-natal day 21 resulted in decreased levels of DRD2 
mRNA (encoding D»R) in the NAc. Likewise, human maternal cannabis 
use was associated with decreased DRD2 mRNA in the fetal NAc”. 
Exposure to THC in early life also blunts the dopaminergic response to 
dopamine-releasing stimuli, such as stress and amphetamine, later in 
life. This was seen in the hypothalamus, striatum and limbic forebrain”!, 
and frontal cortex®”. In a study of CB,R agonist self-administration in 
rats”’, THC exposure during adolescence enhanced the reinforcing 
effects of cannabinoids in adulthood, suggesting that exposure to THC 
during adolescence increased addiction potential during a critical 
period of development. Importantly, THC-exposed rodents had a 
reduced capacity for cannabinoid-induced increases in firing of dopa- 
minergic neurons, consistent with a blunting of the dopamine response 
in adulthood. 


Inconsistencies between animal and human studies 
Preclinical evidence shows that acute THC administration increases 
nerve-firing rates and the results of animal studies using microdialysis 
indicate that acute THC challenge causes dopamine release, yet the results 
of experiments from human studies have not been consistent”. A num- 
ber of factors may underlie the inconsistencies in the results of the studies 
presented above. 

The animal evidence indicates that differences in dopamine activity 
following THC administration are highly region-specific and include 
differing dopaminergic responses in the shell and core of the NAc®. 
Although human PET imaging provides a reliable measure of dopaminer- 
gic function in the striatum®’, the spatial resolution of most PET cameras 
is approximately 4 mm, which limits accuracy when measuring activity 
in small brain volumes (such as in the core and shell of the NAc) owing to 
partial-volume and spill-over effects”®. For the foreseeable future, there- 
fore, human in vivo imaging techniques lack the spatial resolution to 
detect these complex effects on different parts of the dopamine pathway, 
whereas research on living human brain tissue in vitro of course raises 
substantial ethical challenges”. 

A further possibility underlying the lack of consistent dopamine release 
in the human studies is that the dose of THC used in human imaging 
studies has not been sufficient to consistently elicit measurable dopamine 
release. Human studies all involved the administration of 10 mg or less 
of THC because older pharmacological studies indicated that the ‘stand- 
ard joint’ (or cannabis cigarette; defined by the US National Institute on 
Drug Abuse) delivered an approximate THC dose of 8-15 mg, equivalent 
to about 170g kg? (ref. 98). These doses are considerably less than 
those used in animal microdialysis studies, which are typically around 
1 mg kg” !. Correspondingly, though, the THC content of cannabis has 
increased considerably since early clinical studies”’, with current THC 
doses possibly exceeding 40 mg per joint!°, meaning that the doses that 
were used in the published imaging studies no longer reflect typical THC 
exposure in cannabis users. The picture is further complicated by animal 
data indicating that THC exerts complex dose-dependent effects on the 
dopamine system*!; the dose-response profile for THC-induced dopa- 
mine release has not been investigated in humans so it is possible that 
human imaging studies are missing peak dopamine changes. Likewise, 
heterogeneity in animal data may be a reflection of time of sampling in 
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relation to THC treatment. For example, repeated daily THC administra- 
tion led to an initial decrease in dopamine levels, followed by a gradual 
return to baseline in brains that were studied 1h after THC dosing, 
but increases were subsequently observed 2h after dosing’. This time 
period is consistent with acute release and diffusion away, followed by 
an upregulation of synthesis, increasing levels at 2h. Other factors that 
may contribute to the complex temporal course of dopaminergic effects 
include intricate changes in dopamine synthesis and metabolism. These 
comprise both CB|R-mediated increases in tyrosine hydroxylase activity** 
and acute increases and longer-term decreases in monoamine oxidase 
activity'”!, indicating that THC may exert differential effects on the time 
courses of dopamine synthesis and degradation. Equally, the partial ago- 
nist properties of THC on the CBjR serve only to obfuscate its dopa- 
minergic effects, since THC can both activate and block cannabinoid 
receptors in a region-specific and species-dependent manner, based on 
the density and efficiency of populations of receptors and the concentra- 
tions of endogenous agonists’”. 

There are a number of challenges involved in assessing the long-term 
effects of exposure to THC. First, it is plausible that discrepant findings 
may be attributable to the duration of THC treatment. Animal experi- 
ments are typically conducted following administration of THC for no 
longer than three weeks®:”8*4, whereas human studies are conducted in 
participants who have taken cannabis over years (the duration of reg- 
ular cannabis use is typically in excess of three years”). Furthermore, 
it remains unclear to what degree homeostatic mechanisms are able to 
compensate for these alterations over time. Additionally, in humans there 
are a number of psychosocial and genetic risk factors that may predis- 
pose an individual to develop a cannabis use disorder. Given the evi- 
dence of interactions between early life psychological deprivation and 
THC-induced effects on dopamine in animals”, it is possible that the 
human data relating to cannabis-dependent participants is confounded 
by a range of environmental factors, including parental loss and other 
psychosocial factors'”’. Other environmental and physiological fac- 
tors have been found to influence the dopaminergic effects of THC. Of 
particular relevance to humans is sleep deprivation, which decreases 
dopamine turnover in response to THC! and stress, which increases 
THC-induced dopamine synthesis'®°. Cannabis also contains a multitude 
of other compounds, called phytocannabinoids, which probably exert 
different complex actions on the dopamine system. These compounds 
could be behind the varied response observed in human cannabis users 
when compared to the more consistent animal data that uses only THC. 
Owing to the co-morbidity of addictions, it is particularly challenging for 
human imaging researchers to recruit participants with cannabis depend- 
ence who do not use other psychoactive compounds, including nicotine 
and alcohol, in more naturalistic studies. 

Finally, whether THC is delivered in a contingent or non-contingent 
way is likely to be important, given evidence that anticipation leads to 
dopamine release. In general, animal studies include a variety of contin- 
gencies around the dosing of THC. By contrast, in human acute studies 
THC was given to participants with a prior exposure to the drug but was 
not delivered in the habitual manner in which the drug is consumed, 
potentially reducing the degree of dopamine release. 


Dopamine and the behavioural effects of THC 
THC is associated with a number of behavioural effects that are likely to 
involve alterations in dopaminergic function. The first evidence of this 
came from early animal studies, which reported similarities between the 
behavioural effects of THC, including catalepsy and hypothermia! , 
and dopamine antagonists'”’. However, THC also antagonizes the 
locomotor and hyperthermic effects of amphetamine, which causes 
dopamine release’. These seemingly paradoxical effects may be due 
to the high doses used in the early research together with the partial 
agonist effects of THC and/or non-specific receptor-independent 
effects. 

THC promotes increased food intake in animals 
Since appetite is modulated by the dopamine system 


109 0 


and humans!!°, 
1 studies have 
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investigated dopaminergic involvement in THC-induced feeding. The 
dopamine D, receptor antagonist SCH23390 attenuated THC-induced 
feeding at a dose that did not affect feeding on its own'’”. Recent 
research has identified CB,R-mediated changes in hypothalamic pro- 
opiomelanocortin (POMC) neurons!" as the potential underlying cause 
of this process, through mitochondrial uncoupling protein 2, which is 
involved in ghrelin-mediated dopaminergic function!*. 

Heavy cannabis use is associated with impaired educational and 
occupational outcomes'"». Factors that may underlie this association 
include cognitive impairment, which includes executive dysfunction", 
working memory impairments'!” and amotivation''® (defined as 
reduced motivation for goal-directed behaviour'!®). These functions 
are susceptible to mesocortical dopaminergic manipulation!”® (for 
example, prefrontal D; receptor blockade!”!). Although a preclini- 
cal study reported that D, receptor antagonism blocks THC-induced 
working memory deficits'”’, this was not replicated in humans!”*. 
Nonetheless, there is recent evidence that THC-induced working- 
memory deficits are moderated by catechol-O-methyltransferase, 
a key enzyme in the dopamine metabolic pathway’**, which also mod- 
ulates the effects of THC use on dopaminergic cell size in adolescents™. 
Chronic heavy cannabis use produces apathetic behaviour in rhesus 
monkeys!”9, while a study in humans!°? found that the reduced dopa- 
mine synthesis capacity observed in heavy cannabis users was related to 
amotivation. Overlapping features of the amotivational syndrome asso- 
ciated with cannabis use disorders, such as reduced reward sensitivity 
and negative emotionality!”°, were also found to be inversely related 
to methylphenidate-induced dopamine ventral striatal dopamine 
release®, 

The dopamine system is involved in risk of psychosis'~’. An early 
case report described increased striatal dopamine release following 
cannabis intoxication, associated with the exacerbation of psychotic 
symptoms in a patient with schizophrenia!**. Furthermore, cannabis 
users with a diagnosis of schizophrenia, and those at clinical high risk 
for schizophrenia, displayed blunted striatal stress-induced dopamine 
release'”®. Although dopamine release was blunted in cannabis users 
with schizophrenia, it was nevertheless directly related to the induc- 
tion of psychotic symptoms!*°. Supersensivity of postsynaptic D.Rs° 
could explain this apparent paradox; alternatively it could be due to 
impaired endocannabinoid regulation of dopamine signal transduction. 
Patients with schizophrenia using high levels of cannabis show reduced 
anandamide levels, supporting the latter explanation'*!. Accordingly, 
cannabidiol, a phytocannabinoid compound that elevates anandamide 
levels, has been shown to reduce psychotic symptoms!*”. Alternative 
postsynaptic mechanisms include CB,R-D)R heterodimerization'* and 
downstream intracellular mechanisms such as the neuregulin 1-erbB2 
receptor tyrosine kinase 4-phosphoinositide 3 kinase-protein kinase B 
(NRG1I-ERBB4-PI3K-AKT1) pathway. 
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Outlook 

There is now substantial evidence that THC exerts effects on the dopa- 
mine system in animals. The key challenges for the field must be to 
understand the complexity of these effects, identify how they translate to 
humans and appreciate how they relate to the potential negative effects 
of the drug in humans (Box 2). Animal studies demonstrate that the 
acute administration of THC causes region-specific increases in dopa- 
mine release and nerve activity—we must understand the functional 
significance of these increases. The available preclinical evidence sug- 
gests that chronic THC administration causes long-term changes to the 
dopamine system, but these models are limited as they do not reflect 
typical patterns of human use. Future studies should, therefore, be 
of longer duration to reflect human use and should also include 
co-administration with other drugs, such as nicotine and alcohol, 
which are commonly used with cannabis. Related to this is a need to 
understand how the dopaminergic effects of THC are modulated by 
the other phytocannabinoids. Human PET studies have demonstrated 
that dopamine synthesis and dopamine release are blunted in cannabis 
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BOX2 
Strategy for future work 


Preclinical and clinical research 


including modelling contingency in acute THC challenge studies 
Achieve consensus on dose equivalence across species 


changes 


reported between males and females’. 
Preclinical research 


Determine if the long-term effects of THC are reversible with abstinence 
Clinical research 


exposure 


dentify the relationships between cannabis-induced alterations in the dopamine system and behavioural phenomena in humans and animal models 
Use translational techniques that can be applied in human and animal studies alike and employ study designs that better reflect patterns of human use, 


Determine whether THC-induced dopaminergic changes during key developmental phases persist into later life and whether this is linked to behavioural 


nvestigate how gene variants that modulate the endocannabinoid and dopamine systems influence the sensitivity to the rewarding effects of THC and 
vulnerability to addiction, amotivation and psychosis following chronic exposure 
nvestigate the effects of sex in THC response. This will be useful to understand the differences in susceptibility to cannabis effects that have been 


dentify the effects on the dopamine system of long-term THC exposure alongside co-administration with nicotine to reflect typical patterns of human use 
Determine the mechanisms underlying the complex dose-response effects of THC on dopaminergic function 
Elucidate the mechanisms of regional differences in dopaminergic effects and the functional significance of these on behaviour 


Determine if the blunted dopamine release and synthesis seen in chronic users is a pre-existing vulnerability factor or a direct result of repeated THC 


Determine the dopaminergic changes over the course of repeated THC exposures and dose-response effects 
Achieve consensus on how to report previous exposure to cannabis use in the human literature 

Determine whether there are regional differences in dopamine release in response to THC administration in humans 
Determine the dose and context dependency of THC effects on the dopamine system 


users relative to non-users, yet we still need to understand the precise 
mechanisms through which this occurs. 

It is necessary to identify whether the effects of THC on the dopamine 
system are reversible and, if they are not, to recognize when they become 
so. Likewise, further molecular imaging studies in humans are needed to 
determine the dose-response and timing of the acute effects of THC on 
dopamine release, and to determine whether there are regional differ- 
ences in dopamine release—particularly between cortical and sub-cortical 
regions. 

Ultimately, one key challenge in understanding the effects of THC on 
the dopamine system is translating results from animal studies, many 
of which cannot be carried out clinically, into humans. We must also 
reach consensus on THC dose equivalence across species. It is therefore 
difficult to know what the implications of some preclinical animal work 
are for human research and how to back-translate human findings into 
preclinical models. There must be movement away from untranslatable, 
standalone research, towards the use of translational techniques, such 
as PET and MRI approaches that can be conducted in both humans and 
preclinical models that can capture human use patterns in combination 
with traditional preclinical techniques. We must also understand how 
psychoactive metabolites of THC and endocannabinoids modulate THC- 
induced changes in dopaminergic function. 

Given the changing patterns in cannabis use across the world, particu- 
larly in young people, and the increase in the THC content of cannabis, 
there is clearly a pressing need to understand how THC alters dopamin- 
ergic function. This is especially important given that emerging evidence 
that links dopaminergic alterations to a number of the adverse cognitive 
and behavioural consequences of THC, and that there is a dearth of current 
effective biological interventions for many of the psychiatric sequelae of 
cannabis use. Dopaminergic dysfunction may thus represent an area for 
future treatment targets. 

There is evidence that gestational exposure to THC is associated with 
dysregulated dopamine synthesis in later life. This has major potential 
public health implications, given the prevalence of cannabis use in women 
of child-bearing age and that the liberalization of cannabis laws around 
the world may be associated with increased use of the drug amongst 
gravid and nursing mothers. However, there remain gaps in our knowl- 
edge of the developmental effects of THC. In particular, we do not know 
how long the dopaminergic effects persist for and what the behavioural 
consequences are. Perhaps most critically, we do not yet know what the 
effects of THC exposure are on the adolescent dopamine system. 
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The available evidence indicates that THC exposure produces com- 
plex, diverse and potentially long-term effects on the dopamine system. 
These include increased nerve firing and dopamine release in response 
to acute THC, and dopaminergic blunting associated with long-term 
use. Future research should focus on probing the relationships between 
cannabis-induced alterations in the dopamine system and behavioural 
effects in humans and animal models. 
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Sleep is conserved from invertebrates to vertebrates, and is tightly regulated in a homeostatic manner. The molecular and 
cellular mechanisms that determine the amount of rapid eye movement sleep (REMS) and non-REMS (NREMS) remain 
unknown. Here we identify two dominant mutations that affect sleep and wakefulness by using an electroencephalogram/ 
electromyogram-based screen of randomly mutagenized mice. A splicing mutation in the Sik3 protein kinase gene 
causes a profound decrease in total wake time, owing to an increase in inherent sleep need. Sleep deprivation affects 
phosphorylation of regulatory sites on the kinase, suggesting a role for SIK3 in the homeostatic regulation of sleep amount. 
Sik3 orthologues also regulate sleep in fruitflies and roundworms. A missense, gain-of-function mutation in the sodium 
leak channel NALCN reduces the total amount and episode duration of REMS, apparently by increasing the excitability of 
REMS-inhibiting neurons. Our results substantiate the use of a forward-genetics approach for studying sleep behaviours 
in mice, and demonstrate the role of SIK3 and NALCN in regulating the amount of NREMS and REMS, respectively. 


Sleep is an animal behaviour ubiquitously conserved from vertebrates 
to invertebrates, including flies and nematodes‘, and is tightly regu- 
lated in a homeostatic manner. Sleep in mammals exhibits the cycles of 
REMS and NREMS that are defined by characteristic activity of elec- 
troencephalogram (EEG) and electromyogram (EMG). Time spent 
in sleep is determined by a homeostatic sleep need, a driving force 
for sleep/wakefulness switching, which increases during wakefulness 
and dissipates during sleep*°. The spectral power in the delta-range 
frequency (1-4 Hz) of EEG during NREMS has been regarded as one 
of best markers for the current level of sleep need. Conversely, the 
level of arousal is positively correlated with sleep latency, which can be 
regulated independently of sleep need’, reflecting the overall activity 
of wake-promoting neurons. Traditional approaches to locate the 
neural circuits regulating sleep and wakefulness behaviour included 
local ablation of brain regions’~®. Recent advances in optogenetic and 
chemogenetic research have directly demonstrated that switching 
between sleep and wake states is executed by subsets of neurons in 
the basal forebrain!”, lateral hypothalamus!!!? and locus coeruleus!’, 
and that switching between NREMS and REMS is executed by a neu- 
ral network in the pons and medulla'*'’. Despite the accumulating 
information about executive neural circuitries regulating sleep and wake 
states, the molecular and cellular mechanisms that determine the pro- 
pensity of switching between wakefulness, REMS and NREMS remain 
unknown. To tackle this problem, we have used a phenotype-driven 
forward-genetics approach that is free from specific working 
hypotheses’®. Previously, a series of forward-genetics studies using flies 


and mice successfully uncovered the molecular network of the core 
clock genes regulating circadian behaviours!”~””. Sleep-regulating genes 
were also discovered through the screening of mutagenized flies”. 
However, genetic studies for sleep using mice has been challenging 
because of the effective compensation and redundancy in the regulation 
of sleep and wakefulness, and the need for EEG and EMG monitoring 
of the staging of wakefulness, NREMS and REMS. 


Sik3 splice mutation increases NREMS 

We induced random point mutations into C57BL/6] (B6J) male 
mice (Go) by ethylnitrosourea (ENU) and screened more than 8,000 
heterozygous B6J x C57BL/6N (B6N) F mice for dominant sleep and 
wakefulness abnormalities through EEG/EMG-based sleep staging 
(Extended Data Fig. 1a). BON was chosen as a counter strain because its 
sleep and wakefulness parameters are highly similar to B6J (Extended 
Data Fig. 1b), and the entire list of single nucleotide polymorphisms 
has recently become available”°. 

Through our screening, we established a mutant pedigree, which 
we termed Sleepy (Slp), with a markedly prolonged sleep time. Five 
founders of the Sleepy mutant pedigree were born by in vitro fertili- 
zation using sperm from the same ENU-treated Gp male, and showed 
a daily wake time (524 + 19.7 min; mean +s.d.) that was shorter than 
the mean of all mice screened by >3 standard deviations (Extended 
Data Fig. 1c). The Sleepy pedigree showed clear dominant inheritance 
of reduced wake time (Fig. 1a). Linkage analysis in the B6J x BON N2 
generation of five Sleepy pedigrees (B021-B025) produced a single log 


Unternational Institute for Integrative Sleep Medicine (WPI-IIIS), University of Tsukuba, Tsukuba, Ibaraki 305-8575, Japan. @Department of Anatomy, Faculty of Medicine, Toho University, Ota-ku, 
Tokyo 143-8540, Japan. Department of Molecular Genetics, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA. “Department of Neuropharmacology, Graduate School 

of Pharmaceutical Sciences, Nagoya City University, Nagoya, Aichi 467-8603, Japan. SLaboratory Animal Resource Center, University of Tsukuba, Tsukuba, Ibaraki 305-8575, Japan. "Department 
of Neuroscience, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA. 7The Jackson Laboratory, Bar Harbor, Maine 04609, USA. ®Technology and Development Team for 
Mouse Phenotype Analysis, RIKEN Bioresource Center, Tsukuba, Ibaraki 305-0074, Japan. 2Laboratory of Research Advancement, National Center for Geriatrics and Gerontology, Obu, Aichi 474- 
8511, Japan. !°Department of Cellular Neurobiology, Brain Research Institute, Niigata University, Niigata 951-8585, Japan. !'PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama 
332-0012, Japan. !*Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA. !3Howard Hughes Medical Institute, University of Texas Southwestern 
Medical Center, Dallas, Texas 75390, USA. !Life Science Center, Tsukuba Advanced Research Alliance, University of Tsukuba, Tsukuba, Ibaraki 305-8575, Japan. 


*These authors contributed equally to this work. 


00 MONTH 2016 | VOL 000 | NATURE | 1 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


a 
B023 L1,200 
144 
a 
124 L1,000 2 
s 
$104 Lsoo 
3 g 
5 84 3 
8 +600 5 
5 * 400 8 
Z 41 a 
ro) 
= 
24 200 & 
1 3 5 Tt 9 11.13 15 17 19 
Chromosome 
450 500 550 600 650 700 750 800 850 900 950 
c Time spent in wake (min per 24 h) Gene Olfr918 Olfr959 | Pou2f3 | Pou2f3 Sik3 
Chromosome 9 B6N/B6N oO B6J/B6N i | GRCm38 38673080 |39572620| 43147266 |43147276) 46198712 
6.2 Mb rs33672598 MIM BOOOO OOO HOG Base change] AtoG AtoG | AtoC | AtoC GtoA 
31.0 Mb rs13480122 I BO sa OOOO Function |Synonymous| L to P Intron | Intron |Splice donor 
52.6 Mb rs2oe44859 OM PMB: BOOOOO  sozsso104 ° ° 
85.1 Mb rs260373537 TO PPBOO BRSOOOO  soessot05 ° ° 
Bo212 7 2616 0 0 3.11325 4 00 B023#0201 o °o ° 
Bo225 318 4 4 01 3.4 161100 B023#0202 [e) fe) fe) 
cmesfousa mss) 0 484298 Sean) —o [6 a: 
Bo251 6 4216 0 0 3 4275611 — 9 9 o 
Total frequency .1 .15 .6 .05 .1 .01.01 MAS 26 100204 ha 
Sleepy”’* Sleepy*/* B6N#02 
; ' ; f Sik3*/* SikgSP/* SikgSb/Sio 
: wile nim 
Sucks 7 cs ee GAT A G oa — Ctx Hypo Liver Ctx Hypo Liver Ctx Hypo Liver 


AWWW 


9g ia $551 ; 
Sie iB Bh ke 
145 ———— sie m Sika" 
SIK3(SLP) nN eno 
140 —_z ———T z 
= © 5004 
h Brain homogenate i —£ 
o 4004 
+/+ Sip/+ — Sip/Sip Sik3¥+ Sik3SIPZFN/+ £ aaa wie 
kDa Ctx Hypo Liver Ctx Hypo Liver g 
150- = -_ 8 2004 
140- = 100 
0 
24h I Dark 


Figure 1 | Identification of Sik3 splicing mutation responsible for 
reduced total wake time. a, Wake-time distribution of B023 N; littermates 
(bars) and all mice screened (curve). Blue and purple bars indicates 
retrospectively genotyped Sik3*/* and Sik3‘”'* mice, respectively. 

b, Quantitative trait locus (QTL) analysis of B023 pedigree (n = 93) for 
total wake time. Inset shows LOD score peak between rs13480122 and 
rs29644859. c, Haplotype analysis of Sleepy mutant pedigrees, B021-B025, 
in terms of the presence of Sleepy phenotypes. d, Exome sequencing 
results from Sleepy mutant mice of B023 and B024 pedigrees together 

with wild-type mice within the region of the LOD score peak. e, Direct 


odds ratio (LOD) score peak on chromosome 9 (Fig. 1b and Extended 
Data Fig. 2a), between rs13480122 (chr9: 31156626) and rs29644859 
(chr9: 52785119) (Fig. 1b, c). Whole-exome sequencing of Sleepy 
mutants identified a heterozygous single nucleotide substitution at 
the splice donor site (chr9: 46198712) for intron 13 of the Sik3 gene 
(Fig. 1d, e). The mutation predicted an abnormal skipping of exon 
13 (Sik35""'+), which was confirmed by reverse transcription PCR 
(RT-PCR) and sequencing of the Sik3 mRNA (Fig. 1f, Supplementary 
Fig. 1 and Extended Data Fig. 2b). 

SIK3 is a protein kinase that is broadly expressed in neurons of the 
cerebral cortex, thalamus, hypothalamus and brainstem (Extended Data 
Fig. 2c, d), and belongs to the AMP-activated protein kinase (AMPK) 
family. SIK3 has a serine-threonine kinase domain at the N terminus 
and a protein kinase A (PKA) recognition site (Ser551) in the middle 
portion”! (Fig. 1g). The skipping of exon 13 results in an in-frame 
deletion of 52 amino acids, encompassing the PKA site (Fig. 1g, 
Extended Data Fig. 2e and Supplementary Fig. 1). Immunoblotting of 
brain homogenates from Sik35?'* and Sik35”S? mice using anti-SIK3 
antibody detected smaller SIK3 proteins as predicted (Fig. 1g, h and 
Supplementary Fig. 1). The nature of this mutation, taken together 
with our results on SIK3 orthologues in Drosophila melanogaster 
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sequencing of the Sik3 gene. f, RT-PCR of Sik3 mRNA produced smaller 
bands specific to Sik3°? mice. Ctx, cerebral cortex; Hypo, hypothalamus. 
g, Structures of wild-type and mutant SIK3 proteins. h, Immunoblotting 
of brain homogenates showing wild-type (+/+) and mutant SIK3 (Slp/+ 
and Slp/Sip) protein variants. i, RT-PCR of brain mRNA from ZFN-based 
Sik3°”!+ mice showing smaller bands lacking exon 13, in addition to larger 
bands containing the exon 13. j, Sik35?(@N/* mice (n =15) showed a 
shorter total wake time than wild-type littermates (n = 14). ***P< 0.001, 
two-way analysis of variance (ANOVA) followed by Tukey’s test. Data are 
mean = s.e.m. 


and Caenorhabditis elegans (see below), suggests that Sik3°” is a 
gain-of-function allele. 

To confirm genetically that the Sik3 splice mutation is the sole cause 
of the long-sleep phenotype, we introduced the Sik3 exon 13-skipping 
allele into wild-type mice by using either conventional knock-in in 
embryonic stem (ES) cells or zinc-finger nuclease (ZFN) technology 
(Fig. 1i, Extended Data Fig. 3b and Supplementary Fig. 1). As expected, 
these mouse lines exhibited markedly reduced wake time (Fig. 1), 
Extended Data Fig. 3c), similar to the original Sleepy pedigree. Thus, 
the lack of exon 13 of the Sik3 gene causes the Sleepy phenotype. 


Increased sleep need in Sleepy mutants 

No overt abnormality in the sleep EEG/EMG was seen in the Sleepy 
mutants (Extended Data Fig. 4a). Detailed examination of sleep 
and wakefulness behaviour of Sleepy mutants showed that, although 
Sik3°”/+ mice exhibit reduced wake time and increased NREMS time 
both in the light and dark phases, the hypersomnia phenotype is more 
pronounced in the dark phase (Fig. 2a, b), possibly owing to a ceiling 
effect in the light phase. Homozygous Sik3°/S? mice had an even 
shorter total wake time and longer NREMS time than Sik3°”’* mice 
(Fig. 2a, b and Extended Data Fig. 4b-d), showing an allele dosage 
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Figure 2 | Increased sleep need and normal wake-promoting response 
of Sik3 mutant mice. a, b, Circadian variation in wakefulness (a) and 
NREMS (b) in Sik3*/* (n = 22), Sik3°?!*+ (n= 32) and Sik35?/S? (n =31) 
mice. *P < 0.05 (black); *P < 0.001 (red), one-way repeated measures 
ANOVA followed by Tukey’s test. c, Time spent in wakefulness from 
zeitgeber time (ZT)4 to ZT5 after cage change at ZT'5 of Sik3*/* (n=6), 
Sik35?/* (n= 9) and Sik35?S? (n = 6) mice. ***P < 0.001, one-way 
repeated measures ANOVA followed by Tukey’s test. d, Wake time for 
6h after caffeine injection at ZTO in Sik3+/+ (n=6), Sik35”/* (n =6) and 
Sik35!?/S? (n = 6) mice. *P < 0.05 (black); ##P < 0.01 versus 10mg kg™! 
body weight (BW); *P < 0.01 (red) versus 15 mg kg~! BW, two-way 
ANOVA followed by Tukey’s test. e, NREMS delta density of Sik3 mutant 
mice (Sik3t/*, n= 22; Sik3°”!*, n = 32; Sik35”/”, n = 31) across the light- 
dark cycle. *P < 0.05; ***P < 0.001, one-way repeated ANOVA followed 
by Tukey’s test. f, Increase in NREMS delta power of Sik3*/+ (n=7), 
Sik35?/+ (n=7) and Sik3°/S¥ (n = 10) mice after 6-h sleep deprivation. 
*P < 0.05, one-way ANOVA followed by Tukey’s test. g, Increase in 
NREMS delta power after 2, 4 and 6h of sleep deprivation of Sik3*/* 
(n=11) and Sik3°”'+ (n= 11) mice relative to NREMS delta power of the 
same ZT during basal sleep. *P < 0.05; **P < 0.01; ***P < 0.001, two-way 
ANOVA followed by Tukey’s test. h, Phosphorylation of Flag-SIK3 of Flag- 
Sik3*'* brains with or without 4-h sleep deprivation. *P < 0.05, two-way 
ANOVA followed by Tukey’s test. Data are mean + s.e.m. 


effect. Total REMS time was similar among genotype groups (Extended 
Data Fig. 4e). Sik35”/* and Sik3°”’5? mice had a higher NREMS/total 
sleep ratio (Extended Data Fig. 4f) and a lower REMS/total sleep ratio 
(Extended Data Fig. 4g), indicating a NREMS-specific change in the 
Sleepy mutant mice. Longer time spent in REMS during the dark phase 
seems to be secondary to the increased sleep amount and be compen- 
sated with shorter REMS time during the light phase (Extended Data 
Fig. 4h). Female Sik35”'* and Sik35”S” mice also exhibited a similar 
phenotype (Extended Data Fig. 4i). 

We next examined whether the short wake time of Sleepy mice is 
due to a defective wake-promoting response to behavioural or phar- 
macological stimuli. A new cage environment strongly mobilizes 
the wake-promoting system in mice®. Both Sik3°”/+ and Sik35?/S? 
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mice exhibited prominent wake responses to cage change, similar to 
wild-type littermates during the light phase (Fig. 2c) and dark phase 
(Extended Data Fig. 5a). Sik35"”'*+ and Sik35'/S? mice also showed 
increased wake time in response to the administration of caffeine 
(Fig. 2d) or modafinil (Extended Data Fig. 5b), to similar or even 
slightly higher degrees than Sik3+/* littermates. We then examined 
the behavioural circadian rhythm of the Sleepy mutants. The circadian 
period length as assessed by wheel-running behaviour under constant 
darkness was similar among genotype groups (Extended Data Fig. 5c). 
Sleepy mutant mice showed a robust reduction in wake time also under 
constant darkness, similar to light-dark conditions (Extended Data 
Fig. 5d). 

Given the apparently normal wake-promoting and circadian systems, 
we then proposed that the Sleepy mutants have an inherently higher 
sleep need. Indeed, Sleepy mutant mice exhibited a higher density of 
slow-wave activity during NREMS in an allele-dosage dependent fash- 
ion (Fig. 2e), suggesting that the baseline sleep need is increased in the 
mutants. Lower-frequency power in EEG was also increased during 
wakefulness in Sleepy mutant mice (Extended Data Fig. 5e), which may 
be associated with local cortical synchronization owing to increased 
sleep need”. Furthermore, 6h of sleep deprivation from the onset of 
the light phase increased the NREMS delta power of Sik3°” mice to a 
larger extent than Sik3+!*+ mice (Fig. 2f). To examine dose-dependent 
effects, we then conducted 2, 4 and 6h of sleep deprivation. Under our 
conditions, Sik3*’* mice showed only a slight increase after 2h of sleep 
deprivation, whereas Sik3°”’* mice exhibited a marked dose-dependent 
increase in the delta power (Fig. 2g and Extended Data Fig. 5f), demon- 
strating their exaggerated response to sleep deprivation. 

If SIK3 constitutes a part of the enigmatic molecular pathway deter- 
mining the level of homeostatic sleep need, then the protein should be 
modulated by sleep deprivation. Because the function of SIK3 is regu- 
lated by its own phosphorylation”’, we examined its phosphorylation 
status in the brain from mice in which a Flag-tag was inserted into the 
N terminus of SIK3 using CRISPR/Cas9 technology (Extended Data 
Fig. 6a). Immunoblotting and immunoprecipitation with anti-SIK3 
and anti-Flag antibodies detected Flag-SIK3 protein in the brain as 
expected (Extended Data Fig. 6b-d and Supplementary Fig. 1). To 
examine whether sleep deprivation affects the phosphorylation status 
of SIK3, brains of Flag-Sik3 mice were obtained after sleep depriva- 
tion. Quantitative phosphoproteomic analysis of immunopurified Flag- 
tagged proteins showed that sleep deprivation specifically increases 
the phosphorylation of residue Thr221, which is closely associated 
with the kinase activity of SIK3 (ref. 23; Fig. 2h). In the same way, we 
introduced a Flag sequence into the Sik3°? allele, and obtained mice 
expressing mutant Flag—SIK3(SLP) protein (Extended Data Fig. 6b, c 
and Supplementary Fig. 1). In addition to the expected decrease in 
the phosphorylation of Ser551, Flag—SIK3(SLP) in the brains showed 
a decrease in the phosphorylation of Thr469, another PKA site’4, and 
an increase in the phosphorylation of Ser914 compared with Flag-Sik3 
mice under ad libitum sleep during the light phase (Extended Data 
Fig. 5g). After sleep deprivation, these phosphorylation changes were 
sustained and a significant increase in the phosphorylation of Ser674, 
another PKA site**, was recognized (Extended Data Fig. 5g). Thus, 
the lack of the Ser551-containing region disturbs the phosphorylation 
status of other PKA sites in a complex fashion, suggesting a mechanistic 
link with the increased sleep need of Sleepy mutant mice. 


Conserved role of SIK3 in invertebrates 

The exon 13-encoded region of Sik3 is highly conserved among verte- 
brate animals (Fig. 3a). Notably, there is a PKA recognition site within 
the exon 13-encoded region, which is conserved even in Sik3 ortho- 
logues of Drosophila and C. elegans (Fig. 3a and Extended Data Fig. 7). 
To examine the role of Sik3 orthologues in sleep-like behaviours in 
Drosophila, we expressed a phosphorylation-defective SIK3 (Ser563 Ala; 
serine residue equivalent to Ser551 in mouse SIK3) in neuronal cells in 
an RU486-inducible fashion. Daily sleep time of adult flies is increased 
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Figure 3 | Role of Sik3 orthologues in invertebrate sleep-like behaviours. 
a, The phylogenetic conservation of the exon 13-encoded region of Sik3. 
b, c, Sleep time before and after induction of SIK3(Ser563Ala) by RU486 
under constant darkness (32 per group). d, Sleep time of control and 

Sik3 hypomorphic mutant (Sik3(hypo)) in 12-h light:12-h dark (L-D) 
conditions (top, 16 per group, P< 0.001) and in constant darkness (D-D) 
(bottom, 16 per group, P< 0.001); one-way repeated measures ANOVA. 
CT, circadian time. e, Sik3-null mutant worms, kin-29(o0y38) (n= 15), 
exhibited reduced quiescence during lethargus compared with wild-type 
worms (n = 10). kin-29(oy38);Py29::kin-29 worms (n= 9), in which wild- 
type kin-29 was expressed in neuronal cells, restored normal quiescence 
during lethargus. The fractions of quiescence out of lethargus were 
similar (P= 0.98). *P < 0.05; **P < 0.01; ***P < 0.001, one-way repeated 
measures ANOVA followed by Tukey’s test. Data are mean + s.e.m. 


after induction of SIK3(Ser563Ala) (Fig. 3b, c). Conversely, flies with 
a hypomorphic Sik3 mutation showed reduced sleep time in the 
light-dark cycle and in constant darkness (Fig. 3d). Furthermore, a 
null mutation of kin-29, the C. elegans orthologue of Sik3, reduced the 
fraction of quiescence during L4-adult lethargus, a sleep-like state in 
C. elegans? (Fig. 3e). Pan-neuronal expression of kin-29 rescued this 
phenotype. Thus, orthologues of Sik3 also seem to be involved in the 
regulation of sleep amount in fruitflies and nematodes. 


Nalcn mutation reduces REMS 

The EEG/EMG-based dominant screening of ENU-mutagenized 
mice also yielded a mutant pedigree with a REMS abnormality, which 
we termed Dreamless (Drl). The founder of the pedigree showed a 
short REMS episode duration (Fig. 4a), that was heritable in the off- 
spring (Extended Data Fig. 8a). Linkage analysis in the B6J x BON 
N> generation showed a single LOD score peak near rs31233932 
(chr14: 124108797) on chromosome 14 (Fig. 4b and Extended Data 
Fig. 8b). Whole-exome sequencing combined with direct sequencing 
of candidate genes identified a heterozygous missense mutation in 
the Nalcn gene (chr14: 123515403) as the only functionally relevant 
mutation within the mapped region in Dreamless mutants (Fig. 4c 
and Extended Data Fig. 8c). We then confirmed the causal relation- 
ship of the Nalcn gene mutation to the REMS phenotype by intro- 
ducing the same nucleotide substitution in wild-type mice using 
the CRISPR/Cas9 system. CRISPR Nalcn?"'*+ mice showed a short 
REMS episode duration, similar to the original Dreamless pedigree 
(Fig. 4d-g). 
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Figure 4 | Missense mutation in Nalcn gene reduces REMS time and 
episode duration. a, Histogram of REMS episode duration of G; mice 
screened (mean +s.d. = 1.41 +0.19 min). Arrow indicates the founder of 
Dreamless mutant pedigree. b, QTL analysis of Dreamless mutant pedigree 
(n= 56) for REMS episode duration. Inset shows LOD score peak near 
rs31233932. c, Direct sequencing of the Nalcn gene. d, e, Total REMS 

time (d) and REMS episode duration (e) of Nalcn?™*+ mice (n=29) and 
Nalcn*'* mice (n= 25) of the Dreamless mutant pedigree. f, g, Total REMS 
time (f) and REMS episode duration (g) of Nalcn?”'*+ mice (n= 11) and 
Nalcn*'* mice (n= 17) produced by CRISPR/Cas9 technology. 

** D < 0.001, two-tailed Student’s t-test. Data are mean + s.e.m. 


Dreamless mutants showed theta (6-9 Hz)-dominant EEG and 
appropriate muscle atonia during REMS, and did not show any overt 
abnormality in the EEG/EMG (Extended Data Fig. 9a). Nalen?"* mice 
showed a reduced total REMS time owing to a short average REMS 
episode duration (Fig. 4d—g and Extended Data Fig. 9b, c). In other 
words, Nalen>"’* mice failed to maintain the REMS state properly, 
resulting in highly unstable REMS. The total time spent in the wake 
and NREMS states of Nalcn?"’* mice was similar to that of Nalcn*/+ 
mice (Extended Data Fig. 9d). Nalcn?"*+ mice had normal circadian 
period lengths but showed a greatly reduced amplitude of behavioural 
circadian rhythms under constant darkness (Extended Data Fig. 9e, f), 
consistent with the recently reported involvement of NALCN in the 
circadian regulation of neuronal excitability in the suprachiasmatic 
nucleus”®. A marked reduction in the REMS time of Nalcn?"!*+ mice 
was also observed under constant darkness (Extended Data Fig. 9g). 
Spectral analysis of EEG showed a decrease in the theta-range power 
during NREMS and REMS and an increase in low-frequency power 
during the wake state and REMS in Nalcn?"!+ mice (Extended Data 
Fig. 9h), suggesting the possibility that NALCN may regulate various 
oscillations in the brain. 

NALCN is a voltage-independent, non-selective cation channel with 
a proposed role in the control of neuronal excitability”®. It is highly 
expressed in several brainstem nuclei involved in REMS regulation, 
such as the ventrolateral periaqueductal grey, deep mesencephalic 
nucleus (DpMe) and sublateral dorsal nucleus””!+!> (Extended Data 
Fig. 10a—c). The mutation results in an Asn315Lys substitution in helix 
S6 of domain I (Fig. 5a), which is conserved among vertebrates and 
invertebrates (Fig. 5b). 

To examine whether the mutation changes the electrophysiological 
properties of NALCN, we made patch-clamp recordings from HEK293 
cells cotransfected with NALCN or the NALCN(DRL) mutant, together 
with UNC80 and SRC(Tyr529Phe), which allow constitutive activation of 
NALCN?”?8, Both NALCN and NALCN(DRL) showed linear current- 
voltage relationships (Fig. 5c, d and Extended Data Fig. 10d, e), 
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Figure 5 | Dreamless mutation in Nalcn gene increases excitability 

of neurons in the ‘REM-off’ area. a, Schematic structure of NALCN 
protein. b, Phylogenetic conservation of Asn315 residue in NALCN. 

c, Representative traces of membrane currents in response to 300-ms step 
pulses ranging from —80 mV to +80 mV in 10-mV increments (holding 
potential V;, =0 mV, bottom) recorded from HEK293T cells transfected 
with wild-type NALCN (top) or mutant NALCN(DRL) (middle). 

d, Mean current-voltage (I-V) curves in wild-type NALCN (n=5, black 
circles and bottom right) or NALCN(DRL) (n =7, purple circles). 

e, The conductance of NALCN(DRL)-transfected cells was larger than 
that of wild-type NALCN-transfected cells (NALCN, 0.09 + 0.02 nS pF =. 
n=5; NALCN(DRL), 1.81 + 0.62 nS pF, n=7). **P<0.01, Mann- 
Whitney U test. f, Representative trace of membrane potentials of DpMe 
neurons in Nalcn*'+ (top) and Nalcn?™'*+ (bottom) mice. Dashed lines 
indicate 0 mV level. g,h, Mean membrane potentials (g) and spontaneous 
firing rates (h) of DpMe neurons (Nalcn*!*, n= 33; Nalen?"+, n=31). 
*P <0.05, Mann-Whitney U test. Data are mean + s.e.m. 


with similar equilibrium potentials (NALCN, —2.1+1.6mV, n=7; 
NALCN(DRL), —1.2 £0.7 mV, n=5, P=0.94, Mann-Whitney U test). 
However, the ionic conductance of NALCN(DRL) was much larger 
than that of NALCN (Fig. 5e). Similarly, the charge transfer of inward 
and outward currents in NALCN(DRL)-transfected cells was mark- 
edly higher than in NALCN-transfected cells (Extended Data Fig. 10f), 
raising the possibility that NALCN(DRL) may increase the intrinsic 
excitability of REMS-regulatory neurons. To test this possibility, whole- 
cell current-clamp recordings were made from neurons in the DpMe 
(Extended Data Fig. 10a), which contains ‘“REM-off’ cells!*?97°, using 
brain slices from NalcnP™* and Nalcn*'+ mice. The DpMe neurons in 
Nalcn?"'* slices exhibited depolarization (Nalcn*!*, —65.6+1.7 mV, 
n= 33 cells; Nalcn?™+, —57.8+2.0 mV, n=31 cells; P=0.01, Mann- 
Whitney U test) and higher spontaneous firing rates (Nalcn*'t, 
0.4 + 2.3 spikes s-!,n=33 cells; Nalcn?"*+, 2.3+0.8 spikes s-in=31 
cells; P=0.03, Mann-Whitney U test) compared with Nalcn*’* slices 
(Fig. 5f-h). 


Discussion 

Our EEG/EMG-based forward-genetics screen in mice has identified 
distinct sleep phenotypes and mutated genes. Yet our study also illus- 
trates a conserved role of SIK3 in the sleep behaviour of vertebrates 
and invertebrates. We think that the Sleepy mutation in SIK3 increases 
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the animal's intrinsic sleep need, because Sleepy mutant mice exhibit 
(1) a higher density of slow-wave activity, a reliable index of home- 
ostatic sleep need; (2) a larger increase in NREMS delta power after 
sleep deprivation; and (3) a normal waking response to behavioural 
or pharmacological arousal stimuli. Furthermore, we found that the 
functionally relevant phosphorylation status of SIK3 is modulated by 
sleep deprivation, which induces an acute increase of sleep need. Taken 
together with the finding that the Sleepy mutation markedly increases 
the baseline NREMS amounts in an allele dosage-dependent fashion, 
we propose that SIK3 functions in the intracellular signalling pathway 
that dictates sleep need and homeostatically regulates the daily amount 
of sleep. 

We propose that NALCN works in the neuronal groups regulating 
REMS*!*}5 for the maintenance and termination of REMS episodes. 
The narrow abdomen mutant, carrying a loss-of-function mutation in 
the Drosophila orthologue of NALCN, exhibits increased sensitivity 
to anaesthesia*! and abnormal circadian behaviour® in part through 
a blunted circadian change in neuronal excitability”. The crucial role 
of the narrow abdomen in enhancing bi-stability between wakeful- 
ness and the anaesthetized state, as well as between wakefulness and 
sleep*’, is consistent with de-stabilized REMS episodes of the Dreamless 
mutant mice. Thus, like the case of SIK3 above, NALCN orthologues 
also appear to have conceptually analogous roles in regulating sleep- 
related behaviours in both mice and fruitflies. Our results argue for the 
utility of unbiased forward-genetics screens for the discovery of genes, 
alleles and pathways regulating sleep in mammals. 

Note added in proof: A related review article describing the 
neural circuitries controlling sleep and wake states can be found in 
ref. 34. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Animals and mutagenesis. Male C57BL/6J mice (CLEA Japan) were treated with 
ethylnitrosourea (85 mg kg” ', Sigma-Aldrich) by intraperitoneal injection twice at 
weekly intervals at the age of 8 weeks. At the age of 25-30 weeks, the sperm of the 
mice was used for in vitro fertilization with eggs of C57BL/6N mice to obtain F, 
offspring. Mice were provided food and water ad libitum, and were maintained on a 
12-h light:12-h dark cycle and housed under controlled temperature and humidity 
conditions. All procedures were approved by the Institutional Animal Care and 
Use Committee of the University of Tsukuba and the RIKEN BioResource Center, 
University of Texas Southwestern Medical Center at Dallas. 

Surgery. EEG/EMG electrode implantation was performed as described 
previously*?, with isoflurane (3% for induction, 1% for maintenance) used for 
anaesthesia. Seven days after surgery, the mice were tethered to a counterbalanced 
arm (Instech Laboratories) that allowed free movement and exerted minimal 
weight. 

Screening scheme. At the age of 12 weeks, male mice were implanted with EEG/EMG 
electrodes and then screened for sleep/wakefulness behaviour. Examined para- 
meters were total time spent in wake, NREMS and REMS states, episode duration 
of wake, NREMS and REMS states, appearance of muscle atonia during REMS, and 
rebound sleep after 4-h sleep deprivation by shaking the cages. For quantitative 
parameters, we selected mice whose phenotypes deviated from the average by 
at least 3 standard deviations. After confirming the reproducibility of the sleep 
phenotype, the mice were selected for offspring production by natural mating or 
IVF with wild-type females to examine the heritability of the sleep phenotypes. If 
at least 30% of the male littermates showed sleep phenotypes similar to their father, 
we considered the sleep abnormalities to be heritable. 

Linkage analysis. Single nucleotide polymorphisms of Nz mice were determined 
using a custom TaqMan Genotyping assay (Thermo Fisher). The custom probes 
were designed based on the polymorphism data between C57BL/6] and C57BL/6N 
(ref. 20). QTL analysis was performed using J/qtl software (Jackson Laboratory). 
Whole-exome sequencing. Whole exomes were captured with SureSelectXT2 
Mouse All Exon (Agilent) and processed to a paired end 2 x 100-bp run on the 
Illumina HiSeq2000 platform at the UTSW McDermott Center Next Generation 
Sequencing Core. Reads were mapped to the University of California Santa Cruz 
mm9 genome reference sequence for C57BL/6J using Burrows- Wheeler aligner 
and quality filtered using SAMtools. Cleaned BAM files were used to realign data 
and call variants using the Genome Analysis ToolKit to detect heterozygous muta- 
tions. 

Sleep behaviour analysis. The recording room was kept under 12-h light:12-h dark 
cycles and a constant temperature (24-25 °C). To examine sleep-wake behaviour 
under baseline conditions, EEG/EMG signals were recorded for two consecutive 
days from the onset of the light phase. EEG/EMG data were visualized and ana- 
lysed using a MatLab (MathWorks)-based, custom semi-automated staging pro- 
gram followed by visual inspection. EEG signals were subjected to fast Fourier 
transform analysis from 1 to 30 Hz with 1-Hz bin using MatLab-based custom 
software. Epochs containing movement artefacts were included in the state totals 
but excluded from subsequent spectral analysis. Sleep/wakefulness was staged into 
wakefulness, NREMS and REMS. Wakefulness was scored based on the presence 
of low amplitude, fast EEG, and high amplitude, variable EMG. NREMS was char- 
acterized by high amplitude, delta (1-4 Hz) frequency EEG and low EMG tonus, 
whereas REMS was staged based on theta (6-9 Hz)-dominant EEG and EMG atonia. 
Hourly delta density during NREMS indicates hourly averages of delta density 
which is the ratio of delta power to total EEG power at each 20-s epoch. For the 
power spectrum of sleep/wakefulness, the EEG power of each frequency bins was 
expressed as a percentage of the total EEG power over all frequency bins (1-30 Hz) 
and sleep/wakefulness states**°. For sleep deprivation, mice were sleep deprived 
for 2, 4 and 6h from the onset of the light phase by gently touching the cages when 
they started to recline and lower their heads. Food and water were available. To 
evaluate the effect of sleep deprivation, the NREMS delta power during the first 
hour after sleep deprivation was expressed relative to the same zeitgeber time (ZT) 
of the basal recording or relative to the mean of the basal recording. For caffeine 
and modafinil injection experiments, mice were fully acclimatized for intraperito- 
neal injection before sleep recording. After 24-h baseline recording, mice received 
caffeine (Sigma), modafinil (Sigma) or vehicle (0.5% methyl cellulose (Wako)) 
intraperitoneally at ZTO, followed by 12-h recording. Injections were delivered 
once per week, with each injection followed by a 6-8-day washout period, during 
which mice remained in the recording chamber. To examine the sleep/wakefulness 
behaviour under constant darkness, after 48-h recording under a 12-h light:12-h 
dark cycle, EEG/EMG recording continued in constant darkness for 3 days. 
Circadian behaviour analysis. Mice were housed individually in a cage (width 
23cm, length 33 cm, height 14cm) containing a wireless running wheel (Med 
Associate ENV-044). Cages were placed in a light-tight chamber equipped with 
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green LED light (100 Ix at the bottom of the cage). The rotation numbers of wheels 
were obtained with 1-min bin using Wheel manager software (Med Associate). 
Mice were entrained to 12-h light:12-h dark cycle for 7 days, and then released 
into constant darkness for 3 weeks. The free running period was calculated with 
linear regression analysis of activity onset using MatLab-based custom software. 
Circadian activity amplitude was calculated by fast Fourier transform of activity 
data, which were processed with Bartlett window using MatLab-based custom 
software. Relative amplitude was normalized to the mean amplitude of the wild- 
type group. 

Western blot. A rabbit polyclonal antibody against the C-terminal 171 amino 
acids of mouse SIK3 was generated using custom antibody production service 
(Pacific Immunology). Tissues were homogenized using a rotor-stator homoge- 
nizer (Polytron) in ice-cold lysis buffer (20 mM HEPES, pH 7.5, 100mM NaCl, 
10mM NayP2O7, 1.5% Triton X-100,15mM NaF, 1x PhosSTOP (Roche), 5mM 
EDTA, 1 x protease inhibitor (Roche)), and then centrifuged at 13,000g at 4°C. The 
supernatants were separated by SDS-PAGE and transferred on PVDF membrane. 
Western blotting was performed according to standard protocols. 

In situ hybridization. In situ hybridization was performed as described 
previously*”. In brief, a 0.7-0.8-kb fragment of Nalcn cDNA was inserted into 
pGEM-T easy (Promega) and used for DIG-labelled probe synthesis. Mice were 
deeply anaesthetized with sodium pentobarbital and perfused transcardially with 
PBS followed by 4% paraformaldehyde (PFA). Forty-micrometre-thick brain 
sections were treated with 0.3% Triton X-100, digested with 1 :g ml’ proteinase 
K, treated with 0.75% glycine, and then treated with 0.25% acetic anhydride in 
0.1M triethanolamine. After overnight incubation with a digoxigenin (DIG)- 
labelled probe at 60°C, the sections were washed and then incubated with alkaline 
phosphatase-conjugated anti-DIG Fab fragments (Roche, 11175041910). The reac- 
tions were visualized with a 5-bromo-4-chloro-3-indolyl-phosphate/4-nitroblue 
tetrazolium (BCIP/NBT) substrate solution (Roche). 

Cell lines. HEK293 cells (RCB1637) and HEK293T cells (RCB2202) were obtained 
from the RIKEN BRC Cell Bank. Cells were cultured in DMEM (Wako) supple- 
mented with 10% FBS, 1% GlutaMAX (Thermo Fisher Scientific), and penicillin/ 
streptomycin at 37°C in a humidified atmosphere of 5% CO. Cell lines were 
regularly tested for mycoplasma contamination using MycoAlert (Lonza). Cell lines 
were regularly renewed by obtaining cell stocks from the Cell Bank for authen- 
tication. We used HEK293 and HEK293T cells because of their reliable growth, 
high efficiency in transfection and morphology suitable for electrophysiological 
experiments. 

Production of Sik3 gene-modified mice by conventional gene targeting. For 
generating Sik3°” knock-in mice, a genomic fragment containing exon 13 of the 
Sik3 gene was isolated from C57BL/6 mouse genomic BAC clone from a RP23 
mouse genomic BAC library (Advanced GenoTEchs Co). A 1.7-kb fragment of 
FRT-PGK-gb2-neo-FRT-loxP cassette (Gene Bridges) flanked by two flippase 
recognition target (FRT) sites was inserted before exon 12. The targeting vector also 
contains a G-to-A substitution at the fifth nucleotide from the beginning of intron 13. 
The targeting vector was linearized and electroporated into the C57BL/6N ES 
cell line RENKA. Correctly targeted clones were injected into eight-cell stage ICR 
mouse embryos, which were cultured to produce blastocysts and then transferred 
to pseudopregnant ICR females. Resulting male chimaeric mice were crossed with 
female C57BL/6N mice to establish the Sik3°?”"/+ line. To remove the neomycin 
resistance gene with the FLP-FRT system, Sik35?-"+ mice were crossed with 
Actb-FLP knock-in mice. 

Production of Sik3 gene-modified mice using ZFN. The custom-designed ZFN 
mRNAs targeting the exon 13-intron 13 boundary region of the Sik3 gene were 
obtained from Sigma-Aldrich’s Composers Custom ZEN service. Before the final 
assembly of the ZFN products, Sigma-Aldrich validated the designed ZFN binding 
sequences in silico using their bioinformatics tools and in vitro using Nero2A cell 
lines, ensuring high cutting efficiency and specificity using mismatch-specific endo- 
nuclease Cell according to the manufacturer’s instructions. The ZFN mRNAs were 
injected into single-cell stage C57BL/6J mouse zygotes at the University of Texas 
Southwestern Transgenic Core facility. The injected eggs were then transferred to 
pseudopregnant females to generate Fo founders. In total, 45 out of 96 Fy mice were 
found to be modified at the exon 13-intron 13 boundary region of the Sik3 gene. We 
crossed one Fy male mouse that had a 2-bp deletion from the last nucleotide of exon 
13 with female C57BL/6N mice to obtain F, mice of Sik3°"/+ ZEN. The F, mice 
were used to confirm the skipping of exon13 in Sik3 mRNA, which was purified 
from the brains and livers. The F, male mice were used for sleep/wakefulness 
behaviour analysis. 

Production of Flag-Sik3 mice and Nalcn?®" mice by CRISPR/Cas9 technology. 
To produce a Cas9/single-guide RNA (sgRNA) expression vector, oligonucleotide 
DNAs (5'-CACCGCGAGCGGCCATCGACCCGC-3’ and 5/-AAACGC 
GGGTCGATGGCCGCTCGC-3’) were annealed and then inserted into pX330 
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vector (Addgene). The cleavage activity of the pX330-Sik3Ex1 vector was evaluated 
by the EGxxFP system**. Genomic DNA containing exon 1 of the Sik3 gene was 
amplified and inserted into pCAG-EGxxFP to produce pCAG-EGxxFP-Sik3Ex1. 
The pX330-Sik3Ex1 and pCAG-EGxxFP-Sik3Ex1 were transfected into HEK293 
cells. As a donor oligonucleotide, a single-stranded 200-nucleotide DNA was syn- 
thesized (Integrated DNA Technologies), which contained a Flag-haemagglutinin- 
coding sequence in the centre and 70-nucleotide arms at the 5’ and 3 ends. Female 
C57BL/6] mice or Sik3°” knock-in mice were injected with pregnant mare serum 
gonadotropin and human chorionic gonadotropin at a 48-h interval, and mated 
with male C57BL/6J mice. The fertilized one-cell embryos were collected from 
the oviducts. Then, 5ng jl of pX330-Sik3Ex1 vector and 10ng yl! of the donor 
oligonucleotide were injected into the pronuclei of these one-cell-stage embryos. 
The injected one-cell embryos were then transferred into pseudopregnant ICR 
mice. Fy mice were genotyped for the presence of Flag-coding sequence in exon] of 
the Sik3 gene and for the presence of the Sik3°” mutation. Fy mice containing Flag- 
SIK3 were further examined for the presence of the Cas9 transgene and off-target 
effects. Candidate off-target sites were identified based on a complete match of 16 bp 
at the 3’ end, including the PAM sequence. Fy mice were mated with C57BL/6N 
mice to obtain F, offspring. 

Nalcn?" mice were produced as described above. To produce the sgsRNA 
expression vector, pX330-NalcnEx9, oligonucleotide DNAs (5'-CACCA 
GCAATAAACACATTCTGAA-3’ and 5’‘-AAACTTCAGAATGTGTTTA 
TTGCT-3’) were used. Genomic DNA containing exon 9 of the Nalcn gene was 
amplified and inserted into pCAG-EGxxFP to produce pCAG-EGxxFP-NalcnEx9. 
As a donor oligonucleotide, a single-stranded 199-nucleotide DNA containing a 
T-to-A substitution at the centre was synthesized (Integrated DNA Technologies). 
Nalcn mutant mice of N2-N; generation were used for sleep/wakefulness analysis. 
Phosphoproteomic analysis. To evaluate Flag-tagged SIK3 protein in brains, 
we performed peptide mapping of the purified Flag-SIK3 protein. The brains of 
Flag-Sik3 knock-in mice and Flag-Sik3°” knock-in mice were quickly dissected 
after cervical dislocation. Brains were homogenized in detergent-free buffer and 
then centrifuged (100,000, 30 min, 4°C). The supernatant was immunopre- 
cipitated with anti- DDDDK antibody beads (MBL 3325). The eluate was run 
on a polyacrylamide gel and stained with SilverQuest Silver staining kit (Life 
technologies). Flag—SIK3 band (150 kDa) was dissected with a fresh blade. The 
proteins in the bands were reduced with 10mM dithiothreitol and alkylated with 
40 mM iodoacetamide. Each sample was digested with trypsin (41g ml}; Trypsin 
Gold, Promega) at 37°C overnight. The extracted peptides were then separated 
via nano flow LC (Advance LC, Michrom Bioresources) using a C18 column. The 
LC eluate was coupled to a nano-ionspray source attached to a Orbitrap Velos Pro 
mass spectrometer (Thermo Fisher Scientific). All MS/MS spectra were searched 
using Proteome Discoverer 1.3 software (Thermo Fisher Scientific). Peptides were 
mapped through mouse SIK3 (NP_081774) with 56% coverage. 

To examine the effect of sleep deprivation on the phosphorylation status of 
SIK3 protein, five Flag-Sik3 knock-in mice or five Flag-Sik3°? knock-in mice were 
ad libitum slept (S) or sleep-deprived (SD) for 4 h by gentle handling immedi- 
ately after light onset (ZTO-ZT4). Five wild-type mice were used as a negative 
control. At ZT4, mouse brains were quickly dissected after cervical dislocation, 
rinsed with cold PBS, and snap frozen in liquid nitrogen. Each half of the brains 
was lysed in 2 ml of ice-cold lysis buffer (20 mM HEPES, pH 7.4, 150mM NaCl, 
1mM EDTA, 1% Triton X-100, 2mM MgCh, 15mM NaF, 10mM Na,P207) freshly 
supplemented with protease/phosphatase inhibitor cocktail tablets (Roche), and 
homogenized in a glass tissue homogenizer. After brain homogenate was incubated 
for 30 min and centrifuged at 13,000g for 20 min at 4°C, the supernatant was pre- 
cleared by IgG and Protein G beads for 30 min before immunoprecipitation. Each 
pre-cleared lysate was added to 5011 of anti-Flag antibody-conjugated Sepharose 
beads (Sigma, A2220) and rotated overnight at 4°C. After washing the beads five 
times with cold wash buffer (20 mM HEPES, pH 7.4, 150mM NaCl, 1mM EDTA, 
1% Triton X-100, 2mM MgCl, 15mM NaF, 10mM NagP2O7), 501 of elution 
buffer (2% SDS, 60 mM Tris-HCl, pH 6.8, 50mM DTT, 10% glycerol) was added 
and rotated for 10 min at 4°C. Elution was repeated twice and combined into 
one eluate and analysed by western blotting. For each group of Flag knock-in 
mice, the five eluates of were mixed and equally split into two or three samples for 
mass spectrometric analysis. Thus, a total of six (Flag-Sik3) or nine (Flag-Sik3 and 
Flag-Sik3°”) samples were reduced, alkylated, and trypsin digested overnight. After 
desalting, each sample was labelled with a different Tandem Mass Tag (TMT) 
reagent (Thermo Fisher Scientific), then all samples were combined into one 
mixture for HPLC fractionation using a C18 column. A total of 12 fractions were 
collected, and analysed separately on the Orbitrap-Fusion mass spectrometry 
platform (Thermo Fisher Scientific) using a reverse-phase liquid chromatography 
tandem mass spectrometry (LC-MS/MS) method. We performed data analysis to 
identify peptides and quantified reporter ion relative abundance using Proteome 


Discoverer 2.1 (Thermo Fisher Scientific). The relative abundance of quantified 
SIK3 phosphorylation sites was normalized with wild-type negative control and 
total SIK3 protein abundance. 

Patch-clamp recordings from HEK cells. To express wild-type NALCN, we used 
pTracer-CMV2-ratNALCN-EF1a-EGFP (a gift from D. Ren)”. A single nucleotide 
substitution was induced to make pTracer-CMV2-ratNALCN(DRL)-EFla-EGFP 
using a KOD plus Mutagenesis kit (Toyobo). HEK293T cells were grown to 
~50% confluency in 12-well plates. Using Lipofectamine LTX (211) and PLUS 
(Ll) reagents (Thermo Fisher Scientific), the cells were cotransfected with 0.3 1g 
of each plasmid DNA encoding rat NALCN-EGFP (wild type or DRL), mouse 
UNC-80, and mouse SRC (Y529F) (constitutively active Src) in 12-well plates. 
UNC-80 and SRC kinase activate NALCN?””®, In some experiments, the cells were 
incubated with 101M Gd*+ to inhibit NALCN. The cells were dissociated and 
plated on 18-mm coverslips coated with poly-t-lysine in fresh culture medium 
before patch-clamp recordings. 

All patch-clamp recordings from HEK293T cells were performed >72h after 
transfection. Recording patch pipettes were pulled from glass capillaries (1B150F-4, 
World Precision Instruments) using a micropipette puller (P-97, Sutter Instrument) 
to give a resistance of ~9 MO. The series resistance of whole-cell recordings was 
~40 MQ, which was not compensated. Patch pipettes were filled with solution 
containing 150mM CsOH, 120mM methanesulfonic acid, 10 mM NaCl, 10mM 
EGTA, 2mM Mg>ATP and 10mM HEPES (pH 7.4 adjusted with methanesulfonic 
acid; osmolarity, 290—299 mOsm 1-1 adjusted with CsCl). The cells on coverslips 
were transferred to a recording chamber under a fluorescence upright microscope 
(Axio Examiner D1, Zeiss) and continuously perfused with the bath solutions con- 
taining 150mM NaCl, 3.5mM KCl, 10mM HEPES, 20mM glucose, 5mM NaOH, 
2mM MgCl and 1.2mM CaCh (pH 7.4 adjusted; osmolarity, 300-310 mOsm 1). 
The transfected cells were identified by enhance green fluorescent protein (eGFP) 
fluorescence. Patch-clamp recordings were performed at room temperature (24°C) 
using a computer-controlled amplifier (MultiClamp 700B, Molecular Devices). The 
signals were digitized with A/D converter (Digidata 1440A, Molecular Devices), 
and acquired with Clampex (Molecular Devices) at a sampling rate of 50 kHz, 
and low-pass filtered at 5 kHz. At the end of recording, Gd** (101M) was used to 
confirm that the whole-cell currents were mediated through NALCN™. Data were 
analysed using Clampfit (Molecular Devices). The equilibrium potentials were 
calculated from I-V curves. Mean membrane conductance was estimated from 
the regression lines fitted to I- V curves from individual cells. Current, membrane 
conductance and charge transfer were normalized to membrane capacitance. 
Patch-clamp recordings from neurons. Patch pipettes and recording system 
were the same as those used in recordings from HEK293 cells. Acute brain slices 
containing the DpMe were prepared from post-natal day 12-23 Nalcn*'* or 
Nalcn?"!*+ mice. After the induction of deep anaesthesia with isoflurane, mice 
were decapitated and the brains were rapidly removed into an ice-cold cutting 
solution containing 2.5mM KCl, 1.25mM NaH2POg,, 26mM NaHCO;, 25mM 
glucose, 185mM sucrose, 0.5 mM CaCl, and 10mM MgCl (pH 7.4, when bubbled 
with 95% O» and 5% CO ). The brains were cut coronally into 200-250 j1m-thick 
slices with a vibratome (VT-12008S, Leica). The slices were incubated at 37°C for 
Lh in artificial cerebrospinal fluid (aCSF) containing 125 mM NaCl, 2.5mM KCl, 
1.25 mM NaH>PO,, 26mM NaHCO;, 10mM glucose, 2mM CaCl, and 1mM 
MgCh (pH 7.4, when bubbled with 95% O2 and 5% COz) before recordings. Slices 
were transferred to a recording chamber perfused with aCSF under an upright 
microscope (Axio Examiner D1, Zeiss). Patch pipettes were filled with solution 
containing 125mM K-gluconate, 10 mM KCl, 10 mM HEPES, 0.5mM EGTA, 
8mM phosphocreatine-Na2, 4mM ATP-Mg and 0.3 mM GTP-Na (pH 7.3 adjusted 
with KOH; osmolality, 290 mOsm I~!). The DpMe was identified with axon bun- 
dles. Recordings were made from cells located in the medial part of the DpMe. 
Cells showing no action potentials following current injection (>1nA, > 5ms) 
were discarded from analysis. Membrane potentials were recorded for 1-10 min. 
Fruitfly stocks and behavioural assay. Sik3 hypomorph and UAS-Sik3, UAS- 
Sik3(S563A) transgenic flies were gifts from M. Montminy and J. B. Thomas*”. 
elav-GS (GeneSwitch) stocks were from the Bloomington stock centre. Flies were 
reared at 25°C under 12-h light:12-h dark cycle in 50-60% relative humidity on 
a standard fly food consisting of corn meal, yeast, glucose, wheat germ and agar. 

Sleep analysis was performed as described previously*". In brief, male flies 
(2-5 days old) were individually housed in glass tubes (length, 65 mm; inside 
diameter, 3 mm) containing standard fly food at one end and a cotton plug on the 
other end. Sucrose-agar (1% agar supplemented with 5% sucrose) food was used for 
the GeneSwitch system assay, instead of standard food. The glass tubes were placed 
in the Drosophila activity monitor (DAM) (Trikinetics) and the locomotor activity 
of each fly was recorded as the number of infrared beam crossings in 1-min bin. 
Sleep was defined as periods of inactivity lasting 5 min or longer. Sleep assay were 
performed for 3 d under 12-h light:12-h dark cycle conditions and then constant 
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darkness conditions. For 12-h light:12-h dark cyles, zeitgeber time (ZT) was used, 
and for constant darkness, circadian time (CT), with CTO as 12h after lights-off 
of the last 12-h light:12-h dark conditions, was used to indicate the daily time. 

For conditional expression analysis, we used the GeneSwitch system’? where 

expression is induced by a steroid hormone antagonist RU486. Flies are moni- 
tored for 3 days in tubes without drug in constant darkness and then transferred 
to new tubes either with vehicle (0.5% DMSO) alone or with 0.5 mM RU486 
and then further monitored under constant darkness conditions. The expression 
of endogenous or transgenic Sik3 genes was confirmed by RT-PCR using RNA 
from fly heads. 
Nematode strains and quiescence assay. The wild-type strain N> and the mutant 
strain PY1479 kin-29(oy38) X were obtained from the Caenorhabditis Genetics 
Center (CGC)*?. All worms were maintained at 20°C on nematode growth 
medium (NGM) agar plates seeded with E. coli HB101. For construction of 
Pyp0::kin-29, kin-29 cDNA was amplified by RT-PCR and inserted into the plas- 
mid pPD-DEST (a gift from Y. lino) to generate pDEST-KIN-29. Next, we carried 
out the LR-recombinase reaction (Gateway System, Life Technologies) between 
pENTR-Py»0 (a gift from Y. Iino) and pDEST-KIN-29 to generate Pr29::kin-29. 
Proo::kin-29 was injected at 30 ng j1~! together with the injection marker 
Pinyo-3:mcherry (10ng ul?) and the empty vector pPD49_26 (60ng ul?) into the 
kin-29(oy38) mutant worms. 

Quiescence during the L4 to adult lethargus was measured using the 
microfluidic-chamber based assay“. In brief, polydimethylsiloxane-made micro- 
fluidic chambers containing liquid NGM and the E. coli HB101 were loaded with 
early L4 larvae and sealed with a cover glass plus 2% agarose, and set under the 
microscope. Images were taken every 2s for 12 to 20h at 20+ 0.5°C using the 
microscope M205FA (Leica) equipped with the camera MC120HD (Leica) (pixel 
size: 1,024|1m x 768 jtm) controlled by Leica Application Suite V4.3 or the micro- 
scope SZX16 (Olympus) equipped with the camera GR500BCM2 (Shodensha) 
(pixel size: 1,024|1m x 768 1m) controlled by js.Manager (UCSF). Subtraction 
between serial images was carried out using Image J, and worms were regarded as 
quiescent at a specific time point if the difference from the preceding time point 
was less than 1% of the total body size. The fraction of quiescence was defined as 
the number of quiescent time points divided by the total number of time points 
during a period of 10 min. 

The onset of lethargus quiescence was defined as the time point after which 
the fraction of quiescence was higher than 0.05 for at least 20 min, whereas the 
end point was defined as the time point after which the fraction of quiescence 
was lower than 0.05 for at least 20 min. Occasionally, brief episodes of quiescence 
were observed outside of lethargus both in wild-type and mutant worms; these 
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episodes were excluded by setting a threshold of 60 min for the minimum duration 
of lethargus quiescence. 

Statistics. Sample sizes were determined using R software based on averages and 
standard deviations that were obtained from small scale experiments. No method 
of randomization was used in any of the experiments. The experimenters were 
blinded to genotypes and treatment assignment. Statistical analysis was performed 
using SPSS Statistics 22 (IBM) and R software. All data were tested for Gaussian 
distribution and variance. Homogeneity of variances was tested with Levene’s test. 
We used Student's t-test for pairwise comparisons, one-way ANOVA for multiple 
comparisons, one-way repeated measure ANOVA for multiple comparisons with 
multiple data points, and two-way ANOVA for multiple comparisons involving two 
independent variables. ANOVA analyses were subjected to Tukey’s post-hoc test. 
When deviation from normality and lack of homogeneity of variances occurred 
(P<0.05), Mann-Whitney U test was used for group comparison. P< 0.05 was 
considered statistically significant. 

Data availability. The datasets generated and/or analysed during the current study 
are available from the corresponding author on reasonable request. 
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Extended Data Figure 1 | Sleep/wakefulness screening of randomly 
mutagenized mice. a, ENU-treated Gp mice were mated with B6N females 
to obtain the offspring. The F, mice were used for sleep/wakefulness 
analysis. A mouse showing any sleep abnormalities was crossed with 

B6N female mice. The N2 progeny was examined for heritability of sleep 
abnormality and for chromosomal mapping. b, B6J (1 = 20) and B6N 

(n= 21) showed similar total wake time (left, P= 0.67, two-tailed Student's 
t-test), NREMS time (centre, P= 0.66) and REMS time (right, P = 0.84). 
Data are mean +s.e.m. c, The histogram shows total daily wake time of 

all mice screened. Total wake time of screened mice was 735 + 66.9 min 
(mean + s.d.). Arrows indicate the founders of Sleepy mutant pedigrees. 
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Extended Data Figure 2 | QTL analysis of Sleepy mutant pedigrees and 
characterization of Sik3 transcript. a, QTL analysis of B021 (n= 119), B022 
(n=95), B024 (n=59) and B025 (n= 112) pedigrees for total wake time 
produced a single LOD score peak on chromosome 9. b, Direct sequencing of 
the exon 12/13 boundary and exon 13/14 boundary of Sik3 mRNA of Sik13*/* 
mouse. Direct sequencing of the short RT-PCR product specific to Sik3 mutant 
mice shows the direct transition from exon 12 to exon 14. c, d, Sik3 mRNA is 
expressed broadly in forebrain neurons (c). Sik3 mRNA is expressed throughout 
the cerebral cortex in the primary motor area (d). DG, dentate gyrus; LV, lateral 
ventricle; MHb, medial habenula. Scale bars, 1 mm (c) and 250 1m (d). e, 
RT-PCR of Sik3 mRNA from cerebral cortex and liver of Sik3*!*, Sik3°”/* and 
Sik35”’S mice. Normal Sik3 variant lacking exon 15 expressed in the cerebral 
cortex. 
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Extended Data Figure 3 | Sleep/wakefulness of Sik3°? knock-in mice. 

a, The structure of the Sik3 genome and targeting vector for Sik3°”, Neomycin 
resistance gene under the mouse phosphoglycerol kinase promoter (neo) 

was sandwiched with the flippase recognition target (FRT) sequences. The 
guanine at the fifth nucleotide from the beginning of the intron 13 was 
substituted with adenine. The neo cassette was deleted by crossing with 
Actb“¢*!P knock-in mice. b, RT-PCR of Sik3 mRNA of Sik3*?!* knock-in 
mice. c, Total wake time of Sik3°""’+ knock-in mice (n= 10) and Sik3*/+ 
littermates (n = 6). ***P < 0.001, two-way ANOVA followed by Tukey’s test. 
Data are mean + s.e.m. 
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Extended Data Figure 4 | Sleep/wakefulness behaviours of Sik3 mutant (n= 31) mice. *P < 0.05; **P < 0.01; ***P < 0.001, two-way ANOVA 


mice. a, Representative 8-s EEG and EMG for wake, NREMS and REMS followed by Tukey’s test (c-g). *P < 0.05 (red); *P < 0.001 (black), one- 
of Sik3 mutant mice. b, Representative hypnogram of Sik3 mutant mice. way repeated measures ANOVA followed by Tukey’s test (h). i, Total wake 
Wake (blue), NREMS (green) and REMS (red) are indicated from time of female Sik3*/+ (n= 10), Sik3°""+ (n= 11) and Sik 35!/SIp (n=9) 
ZT0 to ZT24. c-g, Total wake time (c), NREMS time (d), REMS time (e), mice. ***P < 0.001, two-way ANOVA followed by Tukey’s test. Data are 
NREMS/total sleep ratio (f), REMS/total sleep ratio (g) and circadian mean -+s.e.m. 


variation of REMS (h) of Sik3+/* (n= 22), Sik3°"?!+ (n =32) and Sik35?/SP 
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Extended Data Figure 5 | Characterization of sleep/wakefulness 
behaviours of Sik3 mutant mice. a, Wake time after cage change at 
ZT15 in Sik3*!* (n=5), Sik35”/+ (n= 10) and Sik3°?/5” (n=5) mice. 
The graph shows time spent in wakefulness from ZT15 to ZT16 under a 
basal condition and after cage change from the home cage to a new cage 
at ZT15. *P < 0.05; ***P < 0.001 versus Sik3+/*+; #P < 0.05; ###P < 0.001, 
one-way repeated measures ANOVA followed by Tukey’s test. b, Wake 
time increases for 3h after modafinil injection at ZTO in Sik3*/* (n=6), 
Sik35'* (n= 6) and Sik3°?/S? (n =6) mice. *P < 0.05; versus modafinil 
10mg kg ' in the same genotype, #P < 0.05, ##P < 0.01, two-way ANOVA 
followed by Tukey’s test. c, The circadian period under constant darkness 
in Sik3*!* (n= 8), Sik3°”'* (n= 8) and Sik35”’5” (n =6) mice. P=0.97, 
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one-way ANOVA. d, Total wake time of Sik3*!* (n= 9) and Sik3S!#/+ 

(n= 12) mice under constant darkness. ***P < 0.001, two-tailed Student's 
t-test. e, EEG power spectra of Sik3*!* (n= 22), Sik3°”!* (n= 32) and 
Sik35!?/S? (n = 31) mice. *P < 0.05; ***P < 0.001, one-way ANOVA 
followed by Tukey’s test. f, Increase in NREMS delta power after 2h, 4h 
and 6h of sleep deprivation of Sik3*/* (n= 11) and Sik35/* (n =11) mice 
relative to mean NREMS delta power during basal sleep. **P < 0.01, two- 
way ANOVA followed by Tukey’s test. g, Phosphorylation of Flag-SIK3 of 
Flag-Sik3*'* brains and of Flag-SIK3(SLP) of Flag-Sik35¥/+ brains with or 
without 4-h sleep deprivation. *P < 0.05; ***P < 0.001, two-way ANOVA 
followed by Tukey’s test. Data are mean +s.e.m. 
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Extended Data Figure 6 | Characterization of Flag-Sik3 mice made by 
CRISPR/Cas9 technology. a, Exon 1 of the Sik3 gene contains the first 
and second methionine residues. The single-guide RNA was designed to 
target the second methionine-coding region. The donor oligonucleotide 
has a Flag-haemagglutinin (HA)-coding sequence immediately after the 
second methionine and 70-nucleotide long arms at both 5’ and 3’ ends. 
The Flag-HA-coding region is followed by an Xbal site. b, Immunoblotting 
of brain homogenates of Sik3*!*, Sik3Pl9g/Flag Sik3F48SlP/+ mice showed 


that anti-Flag antibody detected Flag-SIK3 protein of Sik3/"“8/""8 brains 
and Flag-SIK3 (SLP) protein of Sik3*"85!?/* brains, whereas anti-SIK3 
antibody detected SIK3 proteins of all genotypes. c, RT-PCR of brain Sik3 
mRNA of Sik3*!*, Sik3®!4s/Flas, Sik3FlagS!P/+ mice. d, Tryptic peptides of 
immunoprecipitated and gel-purified Flag—SIK3 protein were analysed by 
LC-MS and mapped on the reference SIK3 protein. The peptide fragments 
were mapped on almost entire SIK3 protein with high confidence. 
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Extended Data Figure 7 | Phylogenetic conservation of the SIK3 protein. 
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Extended Data Figure 8 | Identification of Nalcn mutation of the 
Dreamless mutant pedigree. a, Histogram of REMS episode duration 

in N> littermates of Dreamless mutant pedigree (bars) and all F; mice 
examined (curve). b, Haplotype analysis of chromosome 14 of Dreamless 
mutant pedigree with or without short REMS episode duration. c, Whole- 
exome sequencing of Dreamless mutant N> mice. All mice with short 
REMS episode duration had the single nucleotide substitution in exon 9 of 
the Nalcn gene. 
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Extended Data Figure 9 | Sleep/wakefulness behaviour of Nalcn mutant 
mice. a, Representative 8-s EEG and EMG for wake, NREMS and REMS 
of Nalcn mutant mice b, Representative hypnogram of Nalcn*!* mice 
(top) and Nalcn?"’+ mice (bottom). Wake (blue), NREMS (green) and 
REMS (red) are indicated from ZTO to ZT 12. c, Enlarged hypnogram of 
around ZT7 showed the frequent transitions between NREMS and REMS 
of Nalcn?"*+ mice. d, Total wake time and NREMS time of Nalcn?"'+ mice 
(n=29) and Nalcn*'* mice (n= 25). Wake, P= 0.58; NREMS, P=0.17, 
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one-way ANOVA. e, f, Circadian period length (e) and amplitude of 
circadian behaviour (f) in constant darkness of Nalcn?”*+ mice (n= 6) and 
Nalcn*'* mice (n=7). P=0.76 (e); ***P < 0.001 (f), two-tailed Student’s 
t-test. g, Total REMS time of Nalcn?™'+ mice (n=9) and Nalcn*'* mice 

in constant darkness (n = 8). ***P < 0.001, two-tailed Student's t-test. 

h, EEG power spectra of Nalcn?"™* mice (n= 29) and Nalcn*'* mice 
(n=25). ***P < 0.001, one-way ANOVA followed by Tukey’s test. Data are 
mean +s.e.m. 
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Extended Data Figure 10 | Increased conductance of NALCN(DRL). 
a-c, Nalcn mRNA is expressed in the ventrolateral periaqueductal grey 
mater (vIPAG) and deep mesencephalic nucleus (DpMe) of the upper 
pons (a), the lateral dorsal tegmental nucleus (LDT) and sublateral dorsal 
nucleus (SLD) of the lower pons (b), and the lateral paragigantocellular 
nucleus (LPGi) of the medulla (c). AQ, aqueduct; dscp, decussation of 
superior cerebellar peduncle; IO, inferior olive; scp, superior cerebellar 
peduncle. Scale bars, 500 1m. d, Representative traces of membrane 
currents in response to ramp pulses (V;, =O mV, from —100mV to +100mV 
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in 1s; lower) recorded from HEK293T cells cotransfected with UNC80, 
SRC(Tyr529Phe), and NALCN-GFP (top) or NALCN(DRL)-GFP 
(middle). The traces are averaged from three trials. The transient 
capacitance currents are also recorded. e, Mean current density in response 
to ramp pulses (NALCN, n=5, black line; NALCN(DRL), n=7, purple 
line). The data from NALCN are also shown on an expanded scale (bottom 
right). f, The charge transfer of NALCN(DRL)-transfected cells was larger 
than that of NALCN(WT)-transfected cells. **P < 0.01, Mann-Whitney 

U test. The recording data are same as in e. Data are mean + s.e.m. 
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CRISPR/Cas9 8-globin gene targeting in 
human haematopoietic stem cells 


Daniel P. Dever!*, Rasmus O. Bak!*, Andreas Reinisch?, Joab Camarena!, Gabriel Washington!, Carmencita E. Nicolas!, 
Mara Pavel-Dinu!, Nivi Saxena', Alec B. Wilkens!, Sruthi Mantri!, Nobuko Uchida*+, Ayal Hendel!, Anupama Narla‘, 


Ravindra Majeti?, Kenneth I. Weinberg! & Matthew H. Porteus! 


The 6-haemoglobinopathies, such as sickle cell disease and 3-thalassaemia, are caused by mutations in the B-globin 
(HBB) gene and affect millions of people worldwide. Ex vivo gene correction in patient-derived haematopoietic stem 
cells followed by autologous transplantation could be used to cure 8-haemoglobinopathies. Here we present a CRISPR/ 
Cas9 gene-editing system that combines Cas9 ribonucleoproteins and adeno-associated viral vector delivery of a 
homologous donor to achieve homologous recombination at the HBB gene in haematopoietic stem cells. Notably, we 
devise an enrichment model to purify a population of haematopoietic stem and progenitor cells with more than 90% 
targeted integration. We also show efficient correction of the Glu6Val mutation responsible for sickle cell disease by using 
patient -derived stem and progenitor cells that, after differentiation into erythrocytes, express adult 8-globin (HbA) 
messenger RNA, which confirms intact transcriptional regulation of edited HBB alleles. Collectively, these preclinical 
studies outline a CRISPR-based methodology for targeting haematopoietic stem cells by homologous recombination at 
the HBB locus to advance the development of next-generation therapies for 3-haemoglobinopathies. 


Allogeneic haematopoietic stem-cell transplantation demonstrates 
that transplantation of haematopoietic stem cells (HSCs) with only 
a single wild-type HBB gene can cure the }-haemoglobinopathies. 
However, this transplantation technique is limited because of graft- 
versus-host disease and a lack of immunologically matched donors. An 
alternative to using allogeneic HSCs to cure the }-haemoglobinopathies 
is to use homologous recombination to modify the HBB gene directly 
in autologous HSCs!”. In 1985, Smithies and colleagues? were able 
to modify the human HBB gene by homologous recombination in 
a human embryonic carcinoma cell line, albeit at an extremely low 
frequency (10°). The subsequent discoveries that a site-specific 
DNA double-strand break (DSB) could stimulate homologous- 
recombination-mediated correction of a reporter gene and that 
engineered nucleases could be used to induce this DSB, formed the 
foundation of using homologous-recombination-mediated genome 
editing using engineered nucleases to modify the HBB gene directly*”. 
The ease of engineering as well as the robust activity of the CRISPR/ 
Cas9 RNA-guided endonuclease system makes it a promising tool 
to apply to the continuing challenge of developing effective and 
safe homologous-recombination-mediated genome editing to cure 
8-haemoglobinopathies®”. 

The CRISPR/Cas9 complex consists of the Cas9 endonuclease 
and a 100-nucleotide single-guide RNA (sgRNA). Target identifica- 
tion relies first on recognition of a 3-base-pair protospacer adjacent 
motif (PAM) and then on hybridization between a 20-nucleotide 
stretch of the sgRNA and the DNA target site, which triggers Cas9 
to cleave both DNA strands*. DSB formation activates two highly 
conserved repair mechanisms: canonical non-homologous end-joining 
(NHEJ) and homologous recombination’. Through iterative cycles of 
break and NHEJ repair, insertions and/or deletions (INDELs) can be 


created at the site of the break. By contrast, genome editing by homo- 
logous recombination requires the delivery of a DNA donor molecule 
to serve as a homologous template, which the cellular recombination 
machinery uses to repair the break by a ‘copy and paste’ method”. For 
gene-editing purposes, the homologous recombination pathway can be 
exploited to make precise nucleotide changes in the genome*. One of 
the key features of precise genome editing, in contrast to viral-vector- 
based gene transfer methods, is that endogenous promoters, regulatory 
elements, and enhancers can be preserved to mediate spatiotemporal 
gene expression’"'!~'3. The CRISPR/Cas9 system is highly effective 
at stimulating DSBs in primary human haematopoietic stem and 
progenitor cells (HSPCs) when the sgRNA is synthesized with chemical 
modifications, and then electroporated into cells'4, 

HSCs have the ability to repopulate an entire haematopoietic 
system!*, and several genetic'®'* and acquired!” diseases of the 
blood could potentially be cured by genome editing of HSCs. Recent 
studies have demonstrated efficient targeted integration in HSPCs by 
combining zinc-finger nuclease (ZFN) expression with exogenous 
homologous recombination donors delivered via single-stranded 
oligonucleotides”°, integrase-defective lentiviral vectors*!, or recom- 
binant adeno-associated viral vectors of serotype 6 (rAAV6)223, 
While showing very positive results in vitro, collectively, these studies 
also suggested that targeting HSCs by homologous recombination at 
disease-causing loci is difficult in clinically relevant HSPCs. 

In this study, we achieve efficient homologous-recombination- 
mediated editing frequencies at the HBB locus in CD34* HSPCs 
derived from mobilized peripheral blood using Cas9 ribonucleo- 
proteins combined with rAAV6 homologous donor delivery. In brief, 
we demonstrate: (1) Cas9- and rAAV6-mediated HBB targeting in 
HSCs characterized by the identification of modified human cells in 
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secondary transplants of immunodeficient mice; (2) efficient correction 
of the sickle cell disease (SCD)-causing Glu6Val mutation in HSPCs 
derived from several patients with SCD; and (3) development of a 
purification scheme using either fluorescence-activated cell sorting 
(FACS) or magnetic bead enrichment to create HSPC populations in 
which more than 85% of the cells have been modified by homologous- 
recombination-mediated targeted integration. This purification can 
be performed early in the manufacturing process when HSCs are still 
preserved, and may prove valuable in a clinical setting for removing 
untargeted HSPCs that will be in competition with homologous- 
recombination-edited HSPCs for engraftment and re-population after 
transplantation. 


CRISPR/Cas9-mediated HBB gene editing in HSPCs 

We confirmed high transduction of HSPCs using a self-complementary 
AAV6 (scAAV6) with an SFFV-GFP expression cassette”* (Extended 
Data Fig. 1a). HBB-specific single-stranded AAV6 (ssAAV6) vectors 
were then produced containing SFFV-GFP flanked by arms homolo- 
gous to HBB (Fig. 1a). To achieve gene editing at HBB, we used two 
different CRISPR platforms, which we have previously shown to be 


a b +k 
PAM 


Ws TeTTefefetet Tel [s[sTelelefalals] 


Miachinss [ee vs 
integration —= < e 
c hes d 
60 as 80 ‘ 
70 
ap le 
= 6 60 ere) 
3 & 
a 6 & 50 
> 0 400K 600K 800K n 40 
g o & F 
+ @ 2 
i 20 @ | RNP + AAV z 30 o. Pd 
@ 20 ao 
10 ‘. 
ere 
Ss o & 
Ss 


vr SSC 


Figure 1 | CRISPR/Cas9 and rAAV6-mediated targeted integration at 
the HBB locus in human CD34* HSPCs. a, Schematic of targeted genome 
editing at the HBB locus using CRISPR/Cas9 and rAAV6. Site-specific 
DSBs are created by Cas9 (scissors) mainly between nucleotides 17 and 18 
of the 20-bp target site, which is followed by the 5’-NGG-3’ PAM sequence 
(red). A DSB stimulates homologous recombination (HR) using the 
rAAV6 homologous donor as a repair template. White boxes, HBB exons; 
blue boxes, homology arms; orange boxes, SFFV-GFP-polyA expression 
cassette. b, HSPCs were electroporated with mRNA or RNP CRISPR 
system, and INDELs were analysed via TIDE software (n = number of data 
points within each group, all from different mobilized peripheral blood or 
cord blood donors; **P < 0.01, unpaired t-test. Bars represent means). 

c, HSPCs electroporated as above and transduced with HBB-specific 
rAAV6s were analysed by flow cytometry 18-21 days after electroporation 
when GFP levels were found to be constant. Left, percentage of GFPt 
HSPCs. Right, representative FACS plots (n = number of data points 
within each group, all from different donors; ***P < 0.001, unpaired t-test. 
Bars represent means). d, HSPCs were treated as above but targeted with 

a rAAV6 Glu6Val donor. Frequencies of allele types were quantified by 
sequencing of a total of 600 clones from TOPO-cloned in-out PCRs (n=6, 
all from different cord blood or bone marrow donors. Bars represent 
means). WT, wild type. 
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highly active in primary cells'*. Both platforms use sgRNAs chemi- 
cally modified at both termini with 2’-O-methyl-3’-phosphorothioate, 
and are delivered either in conjunction with Cas9 mRNA or as a 
ribonucleoprotein (RNP) complex. Both platforms yielded high 
INDEL frequencies when electroporated into HSPCs, with the RNP 
showing higher activity (Fig. 1b). By supplying ssAAV6 HBB donors 
after electroporation of Cas9 RNP, we achieved stable green fluorescent 
protein (GFP) expression in an average of 29% of HSPCs (Fig. 1c). 
We observed lower efficiencies using the mRNA platform (15%) (Fig. 1c). 
Cytotoxicity and off-target cleavage activity'* was significantly 
decreased using the RNP system (Extended Data Fig. 1b-d). 

Because AAV genomes can be captured at the site of an off-target 
DSB via NHEJ**”>”°, we performed experiments mismatching a 
nuclease with a non-homologous donor to see whether this occurs 
with our methodology. While approximately 20% of cells that received 
matched HBB nuclease and HBB donor maintained GFP expression 
after 18 days in culture, IL2RG nuclease combined with HBB donor 
resulted in 0.8% GFP* cells (Extended Data Fig. le-g), which was not 
significantly higher than the 0.5% GFP* cells observed when using the 
HBB AAV donor alone. These results demonstrate that capture of the 
HBB donor is an infrequent event using this system in human HSPCs. 
Furthermore, the observed GFP expression in 0.5% of HSPCs with the 
AAV donor alone suggests that random integration of rAAV6 is limited 
in human HSPCs. In fact, previous reports have shown AAV-mediated 
targeted integration without a DSB’””*, and thus, these GFP* cells may 
be the result of on-target events. 

SCD is caused by a single nucleotide mutation (A to T), which 
changes an amino acid (Glu6Val) at codon 6 of the HBB gene”’. 
We created a 4.5-kb rAAV6 donor template that would introduce 
the Glu6Val mutation along with six other silent single nucleotide 
polymorphisms (SNPs) to prevent Cas9 re-cutting of homologous 
recombination alleles (Extended Data Fig. 2a—c). Using this rAAV6 
donor with Cas9 RNP delivery, we measured an average allelic 
modification frequency of 19% in six different HSPC donors (Fig. 1d). 
These results confirm that the combined use of CRISPR with rAAV6 
can precisely change the nucleotide at the position of the mutation 
that causes SCD. 


Early enrichment of HBB- targeted HSPCs 

Because HSCs differentiate and progressively lose their long-term 
repopulating capacity after culturing, the identification of gene-edited 
HSPCs early in the manufacturing process would be of great use. In 
experiments using a GFP-expressing rAAV6 donor we observed that 
while HSPCs receiving only the rAAV6 donor expressed low levels of 
GFP, HSPCs that also received Cas9 RNP generated a population at 
day 4 after electroporation that expressed much higher levels of GFP 
(Fig. 2a, left). We proposed that this GFP"8 population was enriched 
for HBB-targeted cells. We therefore sorted and cultured the GFP™" 
population as well as the GFP!” and GFP~ populations. While sorted 
GFP!’ and GFP~ populations were, respectively, approximately 25% 
and 1% GFP* after 15-20 days in culture, the GFP population was 
more than 95% GFP*, suggesting that this population was indeed HBB- 
targeted (Fig. 2a, right). Linear regression showed that the percentage 
of GFP8h-expressing HSPCs at day 4 after electroporation strongly 
correlates with the total percentage of GFP* cells at day 18 (Extended 
Data Fig. 3). 

To confirm that the GFP™8" population was enriched for on-target 
integration, we used ‘in-out PCR (one primer binding the HBB locus 
outside the region of the homology arm and the other binding the 
integrated insert) to determine integration frequencies and allelic 
distribution in methylcellulose clones derived from the GFP"" popu- 
lation (95 clones). A total of 92% of clones had a targeted integration, 
with 38% containing biallelic integrations (Fig. 2b, Extended Data 
Fig. 4a—c and Supplementary Fig. 1a). This assay generates colonies 
from progenitor cells and the biallelic integration frequency could 
be different in HSCs. Nonetheless, these data show that the log-fold 
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Figure 2 | Enrichment of HBB-targeted HSPCs using FACS and 
magnetic bead-based technologies. a, Left, representative FACS plots 
highlight the GFP*8 population (red) generated by the addition of Cas9 
RNP. Right, HBB-targeted HSPCs from GFP" (red), GFP!°" (green) and 
GFP~ (blue) fractions were sorted and monitored for GFP expression. 
Error bars represent s.e.m. (1 = 11, all from unique mobilized peripheral 
blood or cord blood donors). b, PCR was performed on methylcellulose 
colonies from GFP™8" HSPCs to detect targeted integration at the 3’ end. 
c, Left, representative FACS plots highlight the tNGFR"8" population 
(red) generated by the addition of Cas9 RNP. Right, tNGFR8" (red) 
HSPCs were enriched using anti-CD271 (also known as LNGFR) magnetic 
microbeads and cultured for 18 days while monitoring tNGFR expression. 
Error bars represent s.e.m. (n=5, all from unique cord blood donors). 

d, PCR was performed on tNGF Rhish_derived methylcellulose colonies to 
detect targeted integrations at the 5’ end. 


transgene expression shift after rAAV6 and RNP delivery is due to 
homologous recombination at the intended locus, and that the shift 
allows FACS-based enrichment of HBB-targeted HSPCs. 

Although GFP is not a clinically relevant reporter gene, the trun- 
cated nerve growth factor receptor ({NGFR), in which the cytoplasmic 
intracellular signalling domain is removed, could be used to enrich 
for targeted HSPCs. tNGER is expressed on the cell surface, thereby 
allowing antibody-mediated detection of gene marking, and has already 
been used in human clinical trials****. We examined whether we could 
enrich HBB-targeted HSPCs using tNGFR magnetic-bead-based 
separation technology. HSPCs that received RNP and the rAAV6 donor 
(with a tNGER expression cassette) produced a tNGFR"* population 
that was not present in cells transduced with rAAV6 alone (Fig. 2c, 
left). We then enriched tNGFR"®*" cells using anti-NGFR magnetic 
microbeads and after 18 days in culture, on average 84% of HSPCs were 
tNGER* (Fig. 2c, right). We then performed in-out PCR on tNGFR™ 
methylcellulose clones to determine on-target integration frequencies, 
and found that 86% of clones had a targeted integration, with 30% 
having biallelic integrations (Fig. 2d, Extended Data Fig. 4d-f and 
Supplementary Fig. 1b). 

We next evaluated progenitor cell capacity of GFP'8" HSPCs using 
the colony-forming unit (CFU) assay. HBB GEPhish HSPCs formed 
all types of colonies (erythroid, granulocyte/macrophage, and multi- 
lineage) to a comparable extent as mock-electroporated cells (Extended 
Data Fig. 5a—c). We also evaluated frequencies of GFP" cells in 
subpopulations of HSPCs'>**-*° by immunophenotypic analysis, 
and observed a significant negative correlation between targeting 
frequencies and the immunophenotypic primitiveness of the analysed 
population (Extended Data Fig. 5d-f). To confirm these findings, we 
used another strategy to evaluate targeting rates in the primitive HSC 
population. HSPCs or HSCs were sorted from fresh cord blood, then 
subjected to homologous recombination experiments, and we observed 
a 38% reduction in targeting efficiencies in the HSCs compared to the 
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heterogeneous HSPC population (Extended Data Fig. 5g). We next 
tested whether inefficient targeting of primitive cells could be due to 
reduced rAAV6 donor availability. HSCs and multipotent progenitors 
were transduced with scAAV6-SFFV-GEP, and results showed a 
fivefold reduction in the number of GFP* HSCs and multipotent 
progenitors compared to the bulk CD34* population (Extended Data 
Fig. 5h). Collectively, our data suggest that although HSCs are more 
difficult to target than progenitor cells, we are achieving homologous 
recombination frequencies above 4% and usually above 10%. 


HBB targeting in long-term repopulating HSCs 
The current gold standard for HSC function, defined by the capacity to 
self-renew and form differentiated blood cells, is in vivo engraftment 
into immunodeficient non-obese diabetic (NOD)- severe combined 
immunodeficiency (SCID) Il2rg~/~ (NSG) mice. We used HSPCs 
derived from mobilized peripheral blood for such studies because of 
their high clinical relevance, although these cells have been shown 
to have reduced engraftment capacity in NSG mice compared to 
HSPCs derived from fetal liver, cord blood and bone marrow”*”. All 
transplanted mice displayed human engraftment in the bone marrow 
as measured by the presence of hCD45/HLA-ABC double-positive 
cells 16 weeks after transplant (Fig. 3a and Extended Data Fig. 6a, b). 
While we observed a decrease in human cell chimaerism for all treat- 
ment groups compared to the mock-treated group, all groups with 
nuclease-treated cells displayed similar chimaerism to the rAAV6- 
only group. We did measure a small, but not statistically significant, 
decrease for the RNP plus AAV GFP"8" sroup compared to RNP plus 
AAV, which can be explained by transplantation of fewer total cells 
and fewer phenotypically identified long-term HSCs (Extended Data 
Fig. 6c). There was a significant decrease from RNP plus AAV input 
targeting frequencies (16% in the CD34+ mobilized peripheral blood 
HSPCs) compared to the percentage of GFP* cells in the bone marrow 
at week 16 after transplantation (3.5%) (Fig. 3c). This decrease is 
consistent with previous publications, and immunophenotyping of 
input cells did in fact show an average of 4% targeting in the CD34* 
CD38~ CD90+ CD45RA~ population (Extended Data Fig. 5f). 
Despite these observed reductions in vivo, our median rates of HBB 
gene targeting in human cells in the bone marrow (3.5%) seem to be 
higher than what was found by others, particularly Hoban et al. and 
Genovese et al., in which most mice appeared to have less than 1% gene 
modification after transplant using ZFNs targeting HBB and IL2RG, 
respectively”?!. By contrast, mice transplanted with RNP plus GFPhis 
cells had a median of 90% GFP* human cells at week 16 after transplant, 
with three mice showing more than 97% GFP* human cells (Fig. 3a, c). 
We also observed both myeloid (CD33*) and lymphoid (CD19*) 
reconstitution with a median of 94% and 83% GFP* cells, respectively 
(Fig. 3c), implicating targeting of multipotent HSCs. We detected 5% 
and 49% GFP* human HSPCs (CD34+ CD10_) in the bone marrow 
of mice transplanted with RNP plus AAV and RNP plus AAV GFPhis* 
cells, respectively (Extended Data Fig. 7). Multi-parameter flow cyto- 
metric analysis showed no perturbations in lineage reconstitution and 
no evidence of abnormal haematopoiesis, a functional assessment 
of the safety of the editing procedure. To determine experimentally 
whether we targeted HBB in HSCs, we performed secondary trans- 
plants for the RNP plus AAV and RNP plus AAV GEP"8" groups. For 
both groups, we detected human cells in the bone marrow of secondary 
recipients at weeks 12-14 after transplant, with 7% and 90% GFP* cells 
for the RNP plus AAV and RNP plus AAV GEP"8" sroups, respectively 
(Fig. 3d). More importantly, we confirmed HBB on-target integration 
events in human cells sorted from the bone marrow of secondary 
recipients from RNP plus AAV GFP" 8" experimental groups (Fig. 3e 
and Supplementary Fig. 2). Collectively, these data confirm that our 
strategy targets the HBB gene in human HSCs. 

We next scaled up the genome-editing process to resemble a more 
clinically relevant HSPC starting cell number. We electroporated 
80 million mobilized peripheral-blood-derived HSPCs with the HBB 
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Figure 3 | HBB gene-targeted HSPCs display long-term and multi- 
lineage reconstitution in NSG mice. a, 16 weeks after transplantation, 
mouse bone marrow was analysed for human cell chimaerism and GFP 
expression by flow cytometry. Top, representative FACS plot from a mouse 
transplanted with RNP plus AAV GFP" HSPCs showing engrafted 
human cells in the red gate. Bottom, representative FACS plot showing 
GFP-expressing human cells (red). CD19* B cells and CD33* myeloid 
cells are back-gated and shown in blue and green, respectively. b, Human 
engraftment in NSG mice from all experimental groups. Three different 
HSPC donors were used for engraftment studies (1 = number of data 
points within group). ****P < 0.0001, NS (not significant) = P > 0.05, 
one-way analysis of variance (ANOVA) and Tukey’s multiple comparison 
test. Bars represent median. c, Percentage GFP* cells in the total human 
population (red), CD19* B cells (blue), and CD33* myeloid cells (green) 
(n= number of data points within group). *P < 0.05, ****P < 0.0001, 
one-way ANOVA and Tukey’s multiple comparison test for total human 
cells, and unpaired t-test with Welch’s correction for B and myeloid cells. 
Bars represent median. d, Flow cytometric analysis of cell chimaerism 


RNP system, transduced them with either SFFV-GFP or SFFV-tNGFR 
rAAV6, and then transplanted bulk RNP plus AAV, sorted GFphigh 
and tNGFR"8" cells (enriched by FACS or magnetic microbeads). At 
16 weeks after transplant, all mice displayed engraftment of edited 
human cells in the bone marrow (Fig. 3f). In this large-scale procedure, 
the human cell engraftment of treated cells was equivalent to ‘mock in 
our previous experiment (Fig. 3b). Although we observed reductions 
when comparing editing rates in the input cells to engrafted cells 
in vivo, the HBB-tNGFR mice showed a lower reduction (12% in vitro 
versus 7.5% in vivo) than the HBB-GFP mice (10% in vitro versus 1.9% 
in vivo), suggesting tNGFR might be a better transgene to evaluate 
editing of HSCs in vivo (Fig. 3g). Furthermore, mice transplanted with 
enriched targeted HSPCs displayed human cell-editing frequencies of 
10-75% (three mice), with human engraftment levels ranging from 
4% to 30% (Fig. 3f, g). These data suggest that our methodology can 
be translated to perform large-scale genome editing in HSCs at the 
HBB locus. 


HBB gene correction in SCD HSPCs 

We next tested our methodology to correct the disease-causing Glu6Val 
mutation in CD34* HSPCs derived from patients with SCD. We first 
confirmed high frequencies of INDELs (Fig. 4a) and homologous 
recombination using an SFFV-GFP donor (Fig. 4b) at the HBB locus in 
SCD HSPCs. We then produced a therapeutic rAAV6 donor (corrective 
SNP donor) designed to revert the Glu6Val mutation, while also intro- 
ducing silent mutations to prevent Cas9 re-cutting and premature cross 
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and GFP expression 12-14 weeks after secondary transplantation. Left, 
representative FACS plot from a secondary mouse transplanted with 

RNP plus AAV (top) or RNP plus AAV GFP (bottom) cells showing 
engrafted human cells in the red gate. Right, percentage GFP* human 
cells in the bone marrow of secondary recipients. e, Gel images of in-out 
PCR analyses on sorted human cells from secondary recipients to analyse 
on-target integrations at the 5’ and 3’ ends. Input control (ctrl) PCR 

was performed on the human CCRS gene. Positive control is an HSPC 
sample targeted at HBB with SFFV-GFP-polyA. f, 80 million mobilized 
peripheral-blood-derived CD34* cells were electroporated with HBB RNP 
and transduced with HBB AAV6s. Bulk HSPCs or HSPCs enriched for 
targeting (by FACS or bead enrichment) were transplanted into the tail 
vein of sublethally irradiated mice. Then, 16 weeks after transplant, human 
cell chimaerism was analysed by flow cytometry (n = number of data 
points within group). g, Percentage GFP* and tNGFR? cells in the human 
population was analysed by flow cytometry (m= number of data points 
within group), bars represent median. 


over (Extended Data Fig. 8a). Targeting SCD HSPCs with the corrective 
SNP donor reverted an average of 50% of the Glu6Val (HbS) alleles 
to wild-type (HbA) alleles (Fig. 4c), and analysis of methylcellulose 
clones showed that an average of 45% of clones had at least one HbA 
allele (Extended Data Fig. 8b). We next created an anti-sickling HBB 
cDNA therapeutic donor (HbAS3; ref. 38) using previously reported 
strategies'*'? of knocking in divergent cDNAs into the gene start 
codon to preserve endogenous promoter/enhancer function, followed 
by a clinically relevant promoter (EF1«) driving tNGFR expression to 
allow for enrichment of edited cells (Extended Data Fig. 9a). Using 
this donor, we targeted an average of 11% of SCD patient-derived 
HSPCs (Fig. 4d) and confirmed seamless integration by in-out PCR 
(Extended Data Fig. 9b). Notably, we observed a tNGFR"™* population 
as described previously (Fig. 2c), indicating the ability to enrich SCD- 
corrected HSPCs early in the culture process. We conclude that our 
methodology can correct the Glu6Val mutation using two different 
donor designs. 

We next tested whether the HBB-edited SCD HSPCs maintained 
their erythroid differentiation potential by subjecting tNGFR™2" and 
mock HSPCs to a 21-day in vitro erythroid differentiation protocol??””. 
Flow cytometric analyses after erythroid differentiation showed a 
high proportion of GPA* CD45~ CD71* CD34° cells, indicating 
the presence of mature differentiated erythrocytes that express 
haemoglobin"! (Fig. 4e and Extended Data Fig. 10). To confirm that 
adult 3-globin (HbA) or HbAS3 mRNA was transcribed from edited 
HBB alleles, we performed reverse transcription quantitative PCR 
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Figure 4 | Correction of the Glu6Val mutation in SCD patient-derived 
HSPCs. a, Genomic DNA from HBB RNP-treated SCD HSPCs was 
collected and INDELs were analysed via TIDE software (n = 4 different 
SCD patient donors). b, HBB-targeted SCD HSPCs were analysed for GFP 
expression by flow cytometry (n = 4 different SCD patient donors). c, SCD 
HSPCs were targeted with the rAAV6 corrective SNP donor. HBB allele 
types were analysed by sequencing TOPO-cloned PCR fragments derived 
from in-out PCR. Around 50-100 TOPO clones were analysed from each 
of three different HSPC donors (n = 3 different SCD patient donors). 

d, SCD HSPCs were targeted with the anti-sickling HBB cDNA-EFla 


(RT-qPCR) on erythrocytes differentiated from edited SCD HSPCs. 
Erythrocytes edited with the corrective SNP donor expressed 56% 
HbA mRNA out of total 3-globin mRNA, whereas erythrocytes edited 
with the cDNA donor (bulk) expressed 20% HbAS3 mRNA (Fig. 4f). 
Thus the percentage of HbAS3 mRNA (20%) matched or exceeded the 
percentage of cells modified by the tNGER cassette (11%), suggesting 
functional expression of the AS3 cDNA from the endogenous HBB pro- 
moter. Notably, erythrocytes differentiated from enriched tNGFR™" 
HSPCs expressed 70% HbAS3 mRNA, confirming an enrichment of 
functionally corrected HSPCs. 


Discussion 

While our data support the idea that HSCs are more resistant to 
homologous recombination-mediated editing, they also show that it 
is possible to edit these cells at reasonable frequencies”!. Enrichment 
of targeted cells resulted in the removal of most HSCs, leading to 
an overall eightfold decrease in the total number of HSCs in the 
transplanted enriched population (Extended Data Fig. 6c). Even though 
we transplanted eightfold fewer total HSCs in the enriched population 
than the non-enriched population (GFP"£" versus RNP plus AAV), we 
found that absolute numbers of edited cells in the bone marrow of the 
mice from the enriched group was on average fivefold higher 16 weeks 
after transplant (Extended Data Fig. 6c). Thus, our enrichment strategy 
not only yields higher frequencies of modified cells in the transplanted 
mice, but also the absolute number of modified human cells in the mice 
was higher and can thus ameliorate the problem of inefficient HSC 
targeting. Recent advances in ex vivo HSC expansion protocols and 
identification of small molecule drugs, such as UM171 (ref. 42) that 
expands HSCs, might be combined with our strategy to generate a large 
and highly enriched population of genome-edited HSCs. Future studies 
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-tNGER correction donor. Frequencies of tNGFR* cells were analysed 
by flow cytometry (n= 3 different SCD patient donors). e, SCD 

mock HSPCs and sorted SCD tNGFR"" HSPCs were differentiated 

into erythrocytes in vitro. Representative FACS plots from day 21 of 
differentiation show cell surface markers associated with erythrocytes 
(GPA* CD45~ CD71* CD34-). f, HbS, HbA and HbAS3 mRNA 
expression was quantified by RT-qPCR in erythrocytes differentiated 
from HBB-edited or mock SCD HSPCs. All mRNA transcript levels were 
normalized to the RPLPO input control (n = 2-3 different SCD patient 
donors). All bars represent means. 


will help to determine whether HSC expansion would be required for 
clinical translation of our enrichment model. 

Our methodology sets the framework for CRISPR-mediated 
HBB targeting in HSCs that has the power to be translated to the 
clinic. Although GFP is an unsuitable marker for gene therapy, our 
enrichment protocol using tNGFR (Figs 2c, 3f, g) (or other similar 
signalling-inert cell surface markers) represents a strategy for the next 
generation of 8-haemoglobinopathy therapies that are based on gene 
editing. These studies show that this methodology can enrich corrected 
SCD patient-derived HSPCs that can differentiate into erythrocytes 
that express HBB anti-sickling mRNA from the endogenous HBB 
promoter. This tNGFR selection strategy has the potential advantage 
over chemoselection strategies because it avoids exposing edited cells 
and patients to potentially toxic chemotherapy’*. The strategy of 
knocking in a HBB cDNA along with a selectable marker to enrich for 
modified cells would be applicable to both SCD and almost all forms 
of 8-thalassaemia. Furthermore, because we can efficiently scale up the 
genome-editing process to clinically relevant starting numbers, future 
studies will address the development of a current good manufacturing 
practice (CGMP)-compatible process for editing the HBB locus in 
HSPCs. 

In conclusion, we believe that the presented methodology lays the 
foundation for CRISPR/Cas9-mediated genome editing therapies 
not only for the 8-haemoglobinopathies, but also for a range of other 
haematological diseases, and generally advances HSC-based cell and 
gene therapies. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

AAV vector production. AAV vector plasmids were cloned in the pAAV-MCS 
plasmid (Agilent Technologies) containing inverted terminal repeats from AAV 
serotype 2. The HBB rAAV6 GFP and tNGFR donor contained promoter, MaxGFP 
or tNGER, and BGH polyA. The left and right homology arms for the GFP and 
tNGEFR HBB donors were 540 bp and 420 bp, respectively. The Glu6Val rAAV6 
donor contained 2.2 kb of sequence homologous to the sequence upstream of 
Glu6Val. The nucleotide changes are depicted in Extended Data Fig. 2. Immediately 
downstream of the last nucleotide change was 2.2 kb of homologous HBB sequence. 
HBB cDNA contained same homology arms as GFP and tNGFR donors above 
except the left homology arm was shortened to end at the sickle mutation. Sequence 
of full HBB cDNA is depicted in (Extended Data Fig. 9b). The sickle corrective 
donor used in the SCD-derived HSPCs in Fig. 4 had a total of 2.4 kb sequence 
homology to HBB with the SNPs shown in Extended Data Fig. 8a in the centre. 
scAAV6 carrying the SFFV promoter driving GFP was provided by H.-P. Kiem. 
AAV6 vectors were produced as described with a few modifications”. In brief, 
293FT cells (Life Technologies) were seeded at 13 x 10° cells per dish in ten 15-cm 
dishes one day before transfection. One 15-cm dish was transfected using standard 
PEI transfection with 6\1g ITR-containing plasmid and 221g pDGM6 (a gift from 
D. Russell), which contains the AAV6 cap genes, AAV2 rep genes, and adenovirus 
helper genes. Cells were incubated for 72h until collection of AAV6 from cells by 
three freeze-thaw cycles followed by a 45 min incubation with TurboNuclease at 
250U ml! (Abnova). AAV vectors were purified on an iodixanol density gradient 
by ultracentrifugation at 237,000g for 2h at 18°C. AAV vectors were extracted at 
the 60-40% iodixanol interface and dialysed three times in PBS with 5% sorbitol in 
the last dialysis using a 1OK MWCO Slide-A-Lyzer G2 Dialysis Cassette (Thermo 
Fisher Scientific). Vectors were added pluronic acid to a final concentration of 
0.001%, aliquoted, and stored at —80°C until use. AAV6 vectors were titred using 
quantitative PCR to measure number of vector genomes as described previously“. 
CD34* haematopoietic stem and progenitor cells. Frozen CD34* HSPCs derived 
from bone marrow or mobilized peripheral blood were purchased from AllCells 
and thawed according to manufacturer's instructions. CD34* HSPCs from cord 
blood were either purchased frozen from AllCells or acquired from donors under 
informed consent via the Binns Program for Cord Blood Research at Stanford 
University and used fresh without freezing. CD34* HSPCs from patients with 
SCD were purified within 24h of the scheduled apheresis. For volume reduction 
via induced rouleaux formation, whole blood was added 6% Hetastarch in 0.9% 
sodium chloride injection (Hospira, Inc.) in a proportion of 5:1 (v/v). Following 
a 60-90-min incubation at room temperature, the top layer, enriched for HSPCs 
and mature leukocytes, was carefully isolated with minimal disruption of the 
underlying fraction. Cells were pelleted, combined, and resuspended in a volume 
of PBS with 2mM EDTA and 0.5% BGS directly proportional to the fraction of 
residual erythrocytes—typically 200-400 ml. Mononuclear cells (MNCs) were 
obtained by density gradient separation using Ficoll and CD34* HSPCs were 
purified using the CD34* Microbead Kit Ultrapure (Miltenyi Biotec) according to 
manufacturer's protocol. Cells were cultured overnight and then stained for CD34 
and CD45 using APC anti-human CD34 (clone 561; Biolegend) and BD Horizon 
V450 anti-human CD45 (clone HI30; BD Biosciences), and a pure population of 
HSPCs defined as CD34°"#"'/CD45“™ were obtained by cell sorting on a FACS 
Aria II cell sorter (BD Biosciences). All CD34* HSPCs were cultured in StemSpan 
SFEM II (StemCell Technologies) supplemented with SCF (100 ng ml~}), TPO 
(100 ng ml~?), Flt3 ligand (100 ng ml“), IL-6 (100ng ml“), and StemRegenin1 
(0.75mM). Cells were cultured at 37 °C, 5% CO and 5% O>. 

Electroporation and transduction of cells. The HBB and IL2RG synthetic 
sgRNAs used were purchased from TriLink BioTechnologies with chemi- 
cally modified nucleotides at the three terminal positions at both the 5’ and 
3’ ends. Modified nucleotides contained 2’-O-methyl-3’-phosphorothioate 
and the sgRNAs were HPLC-purified. The genomic sgRNA target sequences, 
with PAM in bold, are: HBB: 5’-CTTGCCCCACAGGGCAGTAACGG-3’ 
(refs 45, 46); IL2RG: 5‘-TGGTAATGATGGCTTCAACATGG-3’. Cas9 mRNA 
containing 5-methylcytidine and pseudouridine was purchased from TriLink 
BioTechnologies. Cas9 protein was purchased from Life Technologies. Cas? RNP 
was made by incubating protein with sgRNA at a molar ratio of 1:2.5 at 25°C for 
10 min immediately before electroporation. CD34+ HSPCs were electroporated 
1-2 days after thawing or isolation. CD34* HSPCs were electroporated using the 
Lonza Nucleofector 2b (program U-014) and the Human T Cell Nucleofection 
Kit (VPA-1002, Lonza) as we have found this combination to be superior in 
optimization studies. The following conditions were used: 5 x 10° cells ml~!, 
300 ,1g ml~! Cas9 protein complexed with sgRNA at 1:2.5 molar ratio, or 
100j1g ml synthetic chemically modified sgRNA with 150,.g ml’ Cas9 mRNA 
(TriLink BioTechnologies, non-HPLC purified). Following electroporation, cells 
were incubated for 15 min at 37°C after which they were added AAV6 donor 


vectors at an MOI (vector genomes/cell) of 50,000-100,000 and then incubated 
at 30°C or 37°C overnight (if incubated at 30°C, plates were then transferred to 
37°C) or targeting experiments of freshly sorted HSCs (Extended Data Fig. 5g), 
cells were electroporated using the Lonza Nucleofector 4D (program EO-100) and 
the P3 Primary Cell Nucleofection Kit (V4XP-3024). For the electroporation of 
80 million CD34* HSPCs, the Lonza 4D-Nucleofector LV unit (program DZ-100) 
and P3 Primary Cell Kit were used. Subsequently, we have found no benefit to the 
30°C incubation and now perform all of our manufacturing at 37°C. 
Measuring targeted integration of fluorescent and tNGFR donors. Rates of 
targeted integration of GFP and tNGER donors were measured by flow cytometry 
at least 18 days after electroporation. Targeted integration of a tNGFR expression 
cassette was measured by flow cytometry of cells stained with APC-conjugated 
anti-human CD271 (NGER) antibody (BioLegend, clone: ME20.4). For sorting 
of GFP" 8" or tNGFR"™8" populations, cells were sorted on a FACS Aria II SORP 
using the LIVE/DEAD Fixable Blue Dead Cell Stain Kit (Life Technologies) to 
discriminate live and dead cells according to manufacturer's instructions. 
Positive selection and enrichment of tNGFR* HSPCs. Positive selection of 
targeted HSPCs was performed using the CD271 (tNGFR) Microbead Kit (Miltenyi 
Biotech), according to the manufacturer’s instructions 72h after electroporation. 
In brief, tNGFR* cells were magnetically labelled with CD271 Microbeads after 
which the cell suspension was loaded onto an equilibrated MACS column inserted 
in the magnetic field of a MACS separator. The columns were washed three times, 
and enriched cells were eluted by removing the column from the magnetic field and 
eluting with PBS. Enrichment was determined by flow cytometry during culture 
for 2-3 weeks by FACS analysis every 3 days. 

Immunophenotyping of targeted HSPCs. Collected wells were stained with 
LIVE/DEAD Fixable Blue Dead Cell Stain (Life Technologies) and then with 
anti-human CD34 PE-Cy7 (581, BioLegend), CD38 Alexa Fluor 647 (AT1, Santa 
Cruz Biotechnologies), CD45RA BV 421 (HI100, BD Biosciences), and CD90 
BV605 (5E10, BioLegend) and analysed by flow cytometry. For sorting of CD34 
or CD34* CD38~ CD90° cells, cord-blood-derived CD34* HSPCs were stained 
directly after isolation from blood with anti-human CD34 FITC (8G12, BD 
Biosciences), CD90 PE (5E10, BD Biosciences), CD38 APC (HIT2, BD Bioscience), 
and cells were sorted on a FACS Aria II (BD Bioscience), cultured overnight, and 
then electroporated with HBB RNP and transduced with HBB GFP rAAV6 using 
our optimized parameters. 

Measuring targeted integration of the E6V donor. For assessing the allele 
modification frequencies in samples with targeted integration of the Glu6Val 
rAAV6 donor, PCR amplicons spanning the targeted region (see Extended 
Data Fig. 2a) were created using one primer outside the donor homology arm 
and one inside: HBB_outside 5/-GGTGACAATTTCTGCCAATCAGG-3’ 
and HBB_inside: 5’-GAATGGTAGCTGGATTGTAGCTGC-3’. The PCR 
product was gel-purified and re-amplified using a nested primer set (HBB_ 
nested_fw: 5’-GAAGATATGCT TAGAACCGAGG-3’ and HBB_nested_rv: 
5’-CCACATGCCCAGTTTCTATTGG-3’) to create a 685-bp PCR amplicon 
(see Extended Data Fig. 2a) that was gel-purified and cloned into a TOPO plasmid 
using the Zero Blunt TOPO PCR Cloning Kit (Life Technologies) according to the 
manufacturer's protocol. TOPO reactions were transformed into XL-1 Blue com- 
petent cells, plated on kanamycin-containing agar plates, and single colonies were 
sequenced by McLab by rolling circle amplification followed by sequencing using 
the following primer: 5’/-GAAGATATGCTTAGAACCGAGG-3’. For each of the 
six unique CD34* donors used in this experiment, 100 colonies were sequenced. 
Additionally, 100 colonies derived from an AAV-only sample were sequenced and 
detected no integration events. 

Measuring INDEL frequencies. INDEL frequencies were quantified using the 
TIDE software’’ (tracking of indels by decomposition) and sequenced PCR 
products obtained by PCR of genomic DNA extracted at least 4 days after electro- 
poration as previously described“. 

Methylcellulose CFU assay. The CFU assay was performed by FACS sorting 
of single cells into 96-well plates containing MethoCult Optimum (StemCell 
Technologies) 4 days after electroporation and transduction. After 12-16 days, 
colonies were counted and scored based on their morphological appearance in 
a blinded fashion. 

Genotyping of methylcellulose colonies. DNA was extracted from colonies 
formed in methylcellulose from FACS sorting of single cells into 96-well plates. 
In brief, PBS was added to wells with colonies, and the contents were mixed and 
transferred to a U-bottomed 96-well plate. Cells were pelleted by centrifugation 
at 300g for 5 min followed by a wash with PBS. Finally, cells were resuspended 
in 25 jl QuickExtract DNA Extraction Solution (Epicentre) and transferred to 
PCR plates, which were incubated at 65°C for 10 min followed by 100°C for 
2 min. Integrated or non-integrated alleles were detected by PCR. For detecting 
HBB GFP integrations at the 3’ end, two different PCRs were set up to detect 
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integrated (one primer in insert and one primer outside right homology arm) 
and non-integrated alleles (primer in each homology arm), respectively (see 
Extended Data Fig. 4a). HBB_int_fw: 5/-GTACCAGCACGCCTTCAAGACC-3’, 
HBB_int_rv: 5‘-GATCCTGAGACTTCCACACTGATGC-3’, HBB_no_ 
int_fw: 5'-GAAGATATGCTTAGAACCGAGG-3’, HBB_no_int_rv: 
5!-CCACATGCCCAGTTTCTATTGG-3’. For detecting HBB tNGFR integra- 
tions at the 5’ end, a 3-primer PCR methodology was used to detect the inte- 
grated and non-integrated allele simultaneously (see Extended Data Fig. 4d). 
HBB_outside_5’Arm_fw: 5‘-GAAGATATGCTTAGAACCGAGG-3’, 
SFFV_rev: 5‘-ACCGCAGATATCCTGTTTGG-3’, HBB_inside_3’Arm_rev: 
5'-CCACATGCCCAGTTTCTATTGG-3’. Note that for the primers assessing non- 
integrated alleles, the Cas9 cut site is at least 90 bp away from the primer-binding 
sites and since CRISPR/Cas9 generally introduces INDELs of small sizes, the 
primer-binding sites should only very rarely be disrupted by an INDEL. 

Transplantation of CD34* HSPCs into NSG mice. For in vivo studies, 6 to 8 
week-old NSG mice were purchased from the Jackson laboratory (Bar Harbour). 
The experimental protocol was approved by Stanford University’s Administrative 
Panel on Laboratory Animal Care. For transplant data in Fig. 3a—c, sample sizes 
were not chosen to ensure adequate power to detect a pre-specified effect size. Four 
days after electroporation/transduction or directly after sorting, 500,000 cells (or 
100,000-500,000 cells for the GFP" group) were administered by tail-vein injec- 
tion into the mice after sub-lethal irradiation (200 cGy) using an insulin syringe with 
a 27 gauge x 0.5 inch (12.7 mm) needle. For transplant data in Fig. 3f, g, three days 
after electroporation, 400,000-700,000 bulk HSPCs or HSPCs enriched for target- 
ing (FACS or bead-enrichment) were transplanted as described above. Mice were 
randomly assigned to each experimental group and evaluated in a blinded fashion. 
For secondary transplants, human cells from the RNP plus AAV group were pooled 
and CD34* cells were selected using a CD34 bead enrichment kit (MACS CD34 
MicroBead Kit UltraPure, human, Miltenyi Biotec), and finally cells were injected 
into the femurs of female secondary recipients (3 mice total). Because GEP'8" mice 
had low engraftment, they were not CD34*-selected, but total mononuclear cells 
were filtered, pooled, and finally injected into the femur of two secondary recipients. 
Assessment of human engraftment. At week 16 after transplantation, mice were 
euthanized, mouse bones (2x femur, 2x tibia, 2x humerus, sternum, 2x pelvis, 
spine) were collected and crushed using mortar and pestle. MNCs were enriched 
using Ficoll gradient centrifugation (Ficoll-Paque Plus, GE Healthcare) for 25 min 
at 2,000g, room temperature. Cells were blocked for nonspecific antibody binding 
(10% v/v, TruStain FcX, BioLegend) and stained (30 min, 4°C, dark) with mono- 
clonal anti-human CD45 V450 (HI30, BD Biosciences), CD19 APC (HIB19, BD 
Biosciences), CD33 PE (WM53, BD Biosciences), HLA-ABC APC-Cy7 (W6/32, 
BioLegend), anti-mouse CD45.1 PE-Cy7 (A20, eBioScience), anti-mouse PE-Cy5 
mTer119 (TER-119, eBioscience) antibodies. Normal multi-lineage engraftment 
was defined by the presence of myeloid cells (CD33*) and B-cells (CD19*) within 
engrafted human CD45* HLA-ABC* cells. Parts of the mouse bone marrow 
were used for CD34-enrichment (MACS CD34 MicroBead Kit UltraPure, human, 
Miltenyi Biotec) and the presence of human HSPCs was assessed by staining 
with anti-human CD34 APC (8G12, BD Biosciences), CD38 PE-Cy7 (HB7, BD 
Biosciences), CD10 APC-Cy7 (HI110a, BioLegend), and anti-mouse CD45.1 
PE-Cy5 (A20, eBioScience) and analysed by flow cytometry. The estimation of 
the total number of modified human cells in the bone marrow at week 16 after 
transplant was calculated by multiplying the percentage engraftment with the 
percentage GFP* cells among engrafted cells. This number was multiplied by 
the total number of MNCs in the bone marrow of a NSG mouse (1.1 x 10° per 
mouse) to give the total number of GFP* human cells in the total bone marrow 
of the transplanted mice. The total number of MNCs in the bone marrow of a 
NSG mouse was calculated by counting the total number of MNCs in one femur 
in four NSG mice. The total number of MNCs in one mouse was then calculated 
assuming one femur is 6.1% of the total marrow as found previously**. 
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Differentiation of CD34+ HSPCs into erythrocytes in vitro. SCD patient-derived 
HSPCs were cultured in three phases following targeting at 37°C and 5% CO in 
SFEM II media according to previously established protocols**“°. Media was sup- 
plemented with 100U ml penicillin/streptomycin, 2mM t-glutamine, 401g ml“! 
lipids, 100 ng ml! SCE, 10 ng ml! IL-3 (PeproTech), 0.5U ml! erythropoietin 
(eBiosciences), and 200j1g ml! transferrin (Sigma Aldrich). In the first phase, 
corresponding to days 0-7 (day 0 being day 4 after electroporation), cells were 
cultured at 10° cells ml~'. In the second phase, corresponding to days 7-11, cells 
were maintained at 10° cells ml~! and erythropoietin was increased to 3U ml"1. 
In the third and final phase, days 11-21, cells were cultured at 10° cells ml~! 
with 3U ml"! of erythropoietin and 1 mg ml"! of transferrin. Erythrocyte 
differentiation of edited and non-edited HSPCs was assessed by flow cytometry 
using the following antibodies: hCD45 V450 (HI30, BD Biosciences), CD34 FITC 
(8G12, BD Biosciences), CD71 PE-Cy7 (OKT9, Affymetrix), and CD235a PE 
(GPA) (GA-R2, BD Biosciences). 

Assessment of mRNA levels in differentiated erythrocytes. RNA was extracted 
from 100,000-250,000 differentiated erythrocytes between days 16-21 of erythroid 
differentiation using the RNeasy Mini Kit (Qiagen) and was DNase-treated with 
RNase-Free DNase Set (Qiagen). CDNA was made from 100 ng RNA using the 
iScript Reverse Transcription Supermix for RT-qPCR (Bio-Rad). Levels of HbS, 
HbA (from corrective SNP donor), and HbA-AS3 (anti-sickling HBB cDNA 
donor) were quantified by qPCR using the following primers and FAM/ZEN/ 
IBFQ-labelled hydrolysis probes purchased as custom-designed PrimeTime qPCR 
Assays from IDT: HbS primer (fw): 5‘-TCACTAGCAACCTCAAACAGAC-3’, 
HbS primer (rv): 5’/-ATCCACGTTCACCTTGCC-3’, HbS probe: 5’-TAACG 
GCAGACTTCTCCACAGGAGTCA-3’, HbA primer (fw): 5’-TCACTAGCAACCT 
CAAACAGAC-3’, HbA primer (rv): 5’/-ATCCACGTTCACCTTGCC-3’, HbA 
probe: 5‘-TGACTGCGGATTTTTCCTCAGGAGTCA-3’, HbAS3 primer fw: 
5'-GTGTATCCCTGGACACAAAGAT-3’, HbAS3 primer (rv): 5’/-GGGC 
TTTGACTTTGGGATTTC-3’, HbAS3 probe: 5’-TTCGAAAGCTTCGGC 
GACCTCA-3’. Primers for HbA and HbS are identical, but probes differ by 
six nucleotides, and therefore it was experimentally confirmed that these two 
assays do not cross-react with targets. To normalize for RNA input, levels of the 
reference gene RPLPO was determined in each sample using the IDT predesigned 
RPLPO assay (Hs.PT.58.20222060). qPCR reactions were carried out on a 
LightCycler 480 II (Roche) using the SsoAdvanced Universal Probes Supermix 
(BioRad) following manufacturer’s protocol and PCR conditions of 10 min at 
95°C, 50 cycles of 15s at 95°C and 60s at 58°C. Relative mRNA levels were deter- 
mined using the relative standard curve method, in which a standard curve for 
each gene was made from serial dilutions of the cDNA. The standard curve was 
used to calculate relative amounts of target mRNA in the samples relative to levels 
of RPLPO. 

Data availability. The authors declare that the data supporting the findings of this 
study are available within the paper. 
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Extended Data Figure 1 | High tropism of rAAV6 for CD34+ HSPCs, 
and viability and specificity assessment of gene editing in CD34+ 
HSPCs. a, CD34* HSPCs were transduced with a scAAV6 expressing 
GFP from an SFFV promoter at multiplicities of infections (MOIs) of 
10,000 or 100,000 viral genomes (vg) per cell for 48 h and then analysed for 
percentage GFP* expression by flow cytometry using a non-transduced 
sample to set the GFP* gate at <0.1% GFP* cells. scAAV was used 
because it eliminates second-strand synthesis as a confounder of actual 
transduction. Results are from two independent experiments from at least 
two donors and error bars represent s.d. ABM, adult bone marrow; CB, 
cord blood; mPB: mobilized peripheral blood. b, CD34* HSPCs were 
electroporated with the HBB CRISPR system (mRNA or RNP delivery) 

or without (AAV only), and then transduced with HBB rAAV6 donor 
vectors at an MOI of 100,000 vg per cell. Day 4 after electroporation, 

cells were analysed by flow cytometry and live cells were gated in high 
forward scatter (FSC) and low side scatter (SSC). Percentage of cells in 
FSC/SSC gate (that is, percentage viability) is shown relative to that of 
mock-electroporated cells. Each data point represents a unique CD34 
HSPC donor. c, Top, sgRNA target sequences at the HBB on-target site 
and a highly complementary off-target site (Chr9:101833584-101833606) 
are shown. PAM sequences are underlined and red sequence highlights 


the three mismatches of the off-target site. Bottom, HSPCs were 
electroporated with either the mRNA or RNP-based CRISPR system, and 
4 days post electroporation genomic DNA was extracted and analysed 
for INDEL frequencies using TIDE at the on-target HBB and the off- 
target site. Results are shown as the ratio of on- to off-target activity 
highlighting the increased specificity of the RNP system. Averages from 
three different CD34* HSPC donors are shown and error bars represent 
s.e.m. **P< 0.01, unpaired Student’s t-test. d, INDEL frequencies for the 
data presented in c. *P < 0.05, paired Student’s t-test. e, Representative 
FACS plots showing stable GFP rates at day 18 after electroporation in 
donor-nuclease mismatch experiments. Mismatching nuclease and donor 
(red box) leads to infrequent end-capture events compared to on-target 
homologous recombination events observed with matched nuclease 

and homologous rAAV6 donor (green box). HSPCs were electroporated 
with 15 |1g Cas9 mRNA and either HBB or IL2RG 2'-O-methyl-3/- 
phosphorothioate-modified sgRNA, then transduced with HBB-GFP 
rAAV6 donor followed by 18 days of culture. f, End-capture experiments 
were performed in three replicate experiments each in three unique 
CD34+ HSPC donors. ns (not significant) = P > 0.05, paired Student's 
t-test. Activity of the IL2RG CRISPR was confirmed by quantification of 
INDELs at the IL2RG target site using TIDE analysis. 
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TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTTACTGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTTA- --CCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -3 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCC---- -CTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -9 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTTA ---~CCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -3 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGT: GCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -4 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTT: GCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -3 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGC- --- -CCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -8 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTTA-~----------- CAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -13 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTTA-TGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -1 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGT -ACTGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -1 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGT GGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -11 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCC- -CTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -9 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCT- --- -TGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -8 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTTA--GCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -2 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTTA------' ‘TGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -6 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCAA- ~~~ ~~~ TGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -9/+2 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTT-CTGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -1 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGC--~ ~~~ ‘TGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -6 
TCAAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGG- -CTGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCT -3/+1 


Extended Data Figure 2 | Schematic of targeting rAAV6 Glu6Val performed. First, a 3,400-bp in-out PCR (green) was performed followed 
homologous donor to the HBB locus. a, The human HBB locus on by a nested 685-bp PCR (purple) on a gel-purified fragment from the first 
chromosome 11 is depicted at the top of the schematic and consists PCR. This second PCR fragment was cloned into TOPO vectors, which 
of three exons (black boxes) and two introns. The rAAV6 Glu6Val were sequenced to determine the allele genotype (wild type, INDEL or 
donor includes the Glu to Val mutation at codon 6, which is the amino homologous recombination). b, The sequence of a wild-type HBB allele 
acid change causing SCD. Other SNPs (all SNPs are capitalized) were aligned with the sequence of an allele that has undergone homologous 
introduced to PAM site (blue) and sgRNA-binding site (bold) to recombination. c, Representative INDELs from the data represented in 
prevent re-cutting following homologous recombination in HSPCs. Fig. 1d. The HBB reference sequence is shown in green. 


To analyse targeted integration frequencies in HSPCs, a two-step PCR was 
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Extended Data Figure 3 | Linear regression model shows that the day 

4 GFPti8" population is a reliable predictor of targeting frequencies. 
Day 4 GFP" #" percentages (x axis) were plotted against day 18 total GFPt 
percentages (y axis), and linear regression was performed. Data were 
generated from experiments including a total of 38 different CD34* HSPC 
donors, treated with either 151g or 30 jg Cas9 RNP to generate data points 
with a wider distribution of targeting frequencies. 
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Extended Data Figure 4 | Overview of PCR genotyping of 
methylcellulose colonies with homologous recombination of the GFP 
and tNGFR donor at the HBB locus. a, The HBB locus was targeted by 
creating a DSB in exon 1 via Cas9 (scissors) and supplying a rAAV6 GFP 
donor template. Alleles with integrations were identified by PCR (red, 
881 bp) on methylcellulose-derived colonies using an in-out primer set. 
Wild-type alleles were identified by PCR (green, 685 bp) using primers 
flanking the sgRNA target site. b, Representative genotyping PCRs 
showing mono- and biallelic clones as well as a clone derived from 
mock-treated cells. NTC, non-template control (see Supplementary 

Fig. 1a for uncropped gel). c, Representative Sanger sequence 
chromatograms for junctions between right homology arm (blue) and 


Mulan hi 


Genomiclocus 5’ homology arm jomology arm ~~ Insert 


insert (green) or genomic locus, highlighting seamless homologous 
recombination. d, The HBB locus was targeted by creating a DSB 

in exon 1 via Cas9 (scissors) and supplying a rAAV6 tNGFR donor 
template. Genotypes were assessed by a three-primer genotyping PCR on 
methylcellulose-derived colonies using an in-out primer set (red, 793 bp) 
and a primer set flanking the sgRNA target site (green, 685 bp). Note that 
the two forward primers are the same. e, Representative genotyping 
PCRs showing a wild-type/unknown, mono-, and biallelic clone 

(see Supplementary Fig. 1b for uncropped gel). f, Representative Sanger 
sequence chromatograms for junctions between left homology arm 

(in blue) and insert (in green) or genomic locus highlighting seamless 
homologous recombination. 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Haematopoietic progenitor CFU assay 

and targeting in different HSPC subpopulations. a, GFP" HSPCs 
were single-cell-sorted into 96-well plates containing methylcellulose. 
Representative images from fluorescence microscopy show lineage- 
restricted progenitors (BFU-E, CFU-E, CFU-GM) and multipotent 
progenitors (CFU-GEMM) with GFP expression. b, CFUs were counted 
14 days after sorting and shown relative to the total number of cells 
sorted (percentage colony formation) (n= 2 different HSPC donors). 

c, Colonies were scored according to their morphology: (1) CFU- 
erythroid (CFU-E); (2) burst forming unit-erythroid (BFU-E); 

(3) CFU-granulocyte/macrophage (CFU-GM); and (4) CFU-granulocyte/ 
erythrocyte/macrophage/megakaryocyte (CFU-GEMM) (n=2 different 
HSPC donors). d, Representative FACS plots at day 4 after electroporation 
of CD34* HSPCs showing the gating scheme for analysing targeting 
frequencies in different HSPC subsets (Extended Data Fig. 5f). Cells were 
immunophenotyped for CD34, CD38, CD90 and CD45RA expression 
and relevant FACS gates are indicated. e, Representative FACS plots 
showing GFP"! cells in the CD34* CD38~ CD90* CD45RA~ population 
of HSPCs derived from mobilized peripheral blood, bone marrow, or 
cord blood. f, 500,000 HSPCs isolated from mobilized peripheral blood, 
adult bone marrow, or cord blood were electroporated with RNP and 
transduced with GFP rAAV6 donor. At day 4 after electroporation, cells 
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were phenotyped by flow cytometry for the cell surface markers CD34, 
CD38, CD90 and CD45RA (Extended Data Fig. 5d, e). Percentage GFPh8 
cells in the indicated subpopulations are shown (data points represent 
unique donors, n =3 per HSPC source), ****P < 0.0001, paired Student’s 
t-test. g, CD34* or CD34+ CD38~ CD90°* cells were sorted directly from 
freshly isolated cord blood CD34* HSPCs, cultured overnight, and then 
electroporated with RNP and transduced with GFP rAAV6. Bars show 
average percentage GFP* cells at day 18 after electroporation. (n= 3 

from different HSPC donors), **P < 0.01, paired Student’s t-test. h, 
Multipotent progenitor (MPP) (CD34+ CD38” CD90~ CD45RA_) and 
HSC (CD34t CD38~ CD90* CD45RA -) populations were sorted from 
fresh cord-blood-derived CD34* HSPCs and immediately after sorting, 
cells were transduced with scAAV6-SFFV-GFP at an MOI of 100,000 vg 
per cell along the bulk HSPC population. scAAV6 was used because it 
eliminates second-strand synthesis as a confounder of actual transduction, 
although the activity of the SFFV promoter may not be equivalent in each 
population, thus potentially underestimating the degree of transduction of 
MPPs and HSCs. Two days later, transduction efficiencies were measured 
by FACS analysis of GFP expression using non-transduced cells (mock) to 
set the GFP* gate. Error bars represent s.e.m., n = 4, two different HSPC 
donors. **P < 0.01; NS, not significant = P > 0.05, unpaired t-test with 
Welch's correction. 
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Extended Data Figure 6 | Analysis of human engraftment. 

a, Representative FACS plot from the analysis of the bone marrow of a 
control mouse not transplanted with human cells. Mice were euthanized 
and bone marrow was collected from femur, tibia, hips, humerus, sternum 
and vertebrae. Cells were subject to Ficoll density gradient to isolate 
mononuclear cells, which were analysed for human engraftment by flow 
cytometry. Human engraftment was delineated as hCD45/HLA-ABC 
double positive. From a total of 157,898 cells, 4 were found within the 
human cell gate of a non-injected control mouse, showing the very limited 
background human staining. b, Representative FACS plots showing gating 
scheme for analyses of NSG mice transplanted with human cells and 
analysed as described in a. Representative plots are from one mouse from 
the RNP plus rAAV6 experimental group. As above, human engraftment 
was delineated as hCD45/HLA-ABC double positive. B cells were marked 
by CD19 expression, and myeloid cells identified by CD33 expression. 
GFP expression was analysed in total human cells (2.4%), B cells (1.9%) 
and myeloid cells (2.8%). The GFP brightness in B cells is lower than in 


Total live cells injected 
per mouse (average 
per mouse) 


Average total HSCs 
injected per mouse 
(CD34*/CD38- 
/CD90*/CD45RA‘) 


Estimated total modified 
human cells in BM per 
mouse at Week 16 


myeloid cells, suggesting that the SFFV promoter is not as active in 

the B-cell lineage compared to the myeloid lineage (see also Fig. 3a). 

c, Overview of engraftment for RNP plus AAV and RNP plus AAV GFPHish 
experimental groups. Average engraftment frequencies and percentage 
GFP* human cells + s.e.m. are shown. Total number of cells transplanted 
was the same (500,000) for all mice in the RNP group, whereas in the 
GFP"8" group, one mouse was transplanted with 100,000 cells, two mice 
with 250,000 cells, and three mice with 500,000 cells. The total number 

of HSCs transplanted per mouse (-£s.e.m.) was calculated based on the 
frequencies of GFP* cells in the CD34* CD38~ CD90* CD45RA™ subset 
analysed by flow cytometry (see Extended Data Fig. 5f) directly before 
injection. The total number of modified human cells in the bone marrow 
at week 16 after transplant per mouse (+s.e.m.) was estimated based on 
calculations presented in the Methods. This shows that the enrichment 
not only resulted in a higher percentage of edited cells (column 3) but also 
resulted in an absolute higher number (column 6) of edited cells. 
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Extended Data Figure 7 | Genome-edited human HSPCs in the 
bone marrow of NSG mice at week 16 after transplantation. 
a, b, Representative FACS plots from the analysis of NSG mice from 
the mock (a) or RNP plus AAV (b) experimental group at week 16 after 
transplantation. Mice were euthanized and bone marrow was obtained, 
MNCs were isolated via Ficoll density gradient, after which human 
CD34? cells were enriched by magnetic-activated cell sorting (MACS), 
and finally cells were stained with anti-CD34, anti-CD38 and anti-CD10 
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antibodies to identify human GFP* cells in the CD34* CD10~ and 
CD34* CD10~ CD38" populations (note that CD10 was included as a 
negative discriminator for immature B cells). c, Collective data from the 
analysis of GFP* cells in the human CD34* CD10~ population from the 
RNP plus AAV (n= 11) and RNP plus AAV GEP"'8" (n = 6) experimental 
groups. For the RNP plus AAV GFP" 8 group, cells from all six mice were 
pooled before analysis and thus, no error bar is available. Error bar on 
RNP group represents s.e.m. 
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Extended Data Figure 8 | Correction of the sickle cell mutation in 
patient-derived CD34* HSPCs. a, Schematic overview of the sequence 

of the sickle cell allele aligned with the sequence of an allele that has 
undergone homologous recombination using the corrective SNP donor. 
The Glu6Val mutation in patients with SCD (A>T) is highlighted in 
yellow. The sgRNA recognition sequence, the PAM site and the cut site 
(scissors) are shown. The donor carries synonymous nucleotide changes 
between the sickle nucleotide and the cut site to avoid premature crossover 


during homologous recombination. Synonymous changes are also added 
to the PAM and an early nucleotide in the sgRNA target site to avoid 
subsequent re-cutting and potential inactivation of the corrected allele. 

b, HSPCs from two different patients with SCD were targeted with the 
corrective SNP donor and seeded in methylcellulose. After 14 days, in-out 
PCR amplicons from a total of 38 clones were sequenced and genotypes 
were extracted from sequence chromatograms. 
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Extended Data Figure 9 | Targeting HBB with a cDNA donor anda which constitutes a clinically compatible expression cassette enabling 


tNGER expression cassette. a, Schematic representation of the AAV6 enrichment and tracking of modified cells. b, Chromatogram from 
donor encoding a functional HBB anti-sickling cDNA (Gly16Asp, sequencing of an in-out PCR amplicon on CD34* cells derived from 
Glu22Ala, Thr87Gln) followed by an expression cassette for tNGFR. patients with SCD, electroporated with HBB Cas9 RNP and transduced 
The left homology arm stops just before the sickle mutation (AT) with rAAV6 HBB cDNA donor. PCR was performed on genomic DNA 
followed by the remaining HBB cDNA, which has been diverged from the extracted 4 days after electroporation of a bulk sample. Chromatogram 
endogenous sequence by introducing synonymous mutations at codon shows the sequence of the full HBB cDNA with start codon, sickle-cell 
wobble positions. The HBB cDNA expression cassette is followed by an codon position (containing a corrected Glu codon), and stop codon 
EF1la promoter driving tNGFR expression (HBB cDNA-EFla-tNGEFR), highlighted in red, green and blue boxes, respectively. 
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Extended Data Figure 10 | Edited HSPCs from patients with SCD 
differentiate into erythrocytes that express glycophorin A. CD34* 
HSPCs derived from patients with SCD were edited with HBB Cas9 RNP 
and either the corrective SNP donor or the cDNA donor. Four days after 
electroporation, cells edited with the cDNA donor were sorted for tNGFRt 
cells. This population as well as the populations edited with the corrective 
SNP donor and mock-electroporated cells were subjected to a 21-day 
erythrocyte differentiation protocol, followed by staining for glycophorin 
A (GPA). All data points within experimental groups are derived from 
experiments in cells from different patients with SCD, n =3 (mock) and 
n=2 (SNP and cDNA donor). 
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Intermediary metabolism generates substrates for chromatin modification, enabling the potential coupling of metabolic 
and epigenetic states. Here we identify a network linking metabolic and epigenetic alterations that is central to oncogenic 
transformation downstream of the liver kinase B1 (LKB1, also known as STK11) tumour suppressor, an integrator of 
nutrient availability, metabolism and growth. By developing genetically engineered mouse models and primary pancreatic 
epithelial cells, and employing transcriptional, proteomics, and metabolic analyses, we find that oncogenic cooperation 
between LKBI loss and KRAS activation is fuelled by pronounced mTOR-dependent induction of the serine-glycine- 
one-carbon pathway coupled to S-adenosylmethionine generation. At the same time, DNA methyltransferases are 
upregulated, leading to elevation in DNA methylation with particular enrichment at retrotransposon elements associated 
with their transcriptional silencing. Correspondingly, LKB1 deficiency sensitizes cells and tumours to inhibition of serine 
biosynthesis and DNA methylation. Thus, we define a hypermetabolic state that incites changes in the epigenetic landscape 
to support tumorigenic growth of LKB1- mutant cells, while resulting in potential therapeutic vulnerabilities. 


Substrates and inhibitors of chromatin-modifying enzymes are gen- 
erated in intermediary metabolism, so changes in nutrient availability 
and utilization can influence epigenetic regulation”. Importantly, 
recent studies have indicated that the interplay between metabolism 
and epigenetics can serve as a programmed switch in cell states. For 
example, mouse embryonic stem cell differentiation is promoted by 
succinate-mediated inhibition of histone demethylases (HDMs) and 
TET DNA demethylases*, or by decreased S-adenosyl-methionine 
(SAM) levels leading to loss of histone H3K4 methylation’. Moreover, 
aberrant metabolic activity can produce pathological effects by altering 
chromatin regulation. Most notably, mutations in the genes encoding 
the isocitrate dehydrogenase (IDH)1 and IDH2 enzymes lead to the 
generation of 2-hydroxyglutarate, which inhibits HDMs and TETs and 
thereby alters DNA and histone methylation—changes that have been 
implicated in overriding cell differentiation and promoting tumorigenesis”. 
Whether this paradigm extends more generally to other oncogenic 
mutations remains unclear, and this question has implications for 
understanding cancer pathogenesis and developing improved treat- 
ments. Here, we demonstrate that dynamic exchange between metabo- 
lism and chromatin regulation contributes to pancreatic tumorigenesis 
driven by mutation of the LKB1 serine-threonine kinase. 

LKB1 is mutationally inactivated in a range of sporadic cancers, 
including pancreatic carcinomas®*. Additionally, germline mutations 
in LKB1 cause Peutz-Jeghers syndrome, which comprises gastrointesti- 
nal polyps and a high incidence of gastrointestinal tract carcinomas (for 
example, an approximately 100-fold increase in pancreatic cancer)". 
Cancers with LKB1 mutations tend to exhibit aggressive clinical fea- 
tures and different therapeutic sensitivity from cancers without these 


mutations’'"!*, LKB1 directly activates a family of 14 kinases related 
to AMP-activated protein kinase (AMPK), many of which are coupled 
to nutrient sensing and broadly reprogram cell metabolism!°. Thus, 
metabolic rewiring is thought to be a driver of tumorigenesis after LKB1 
loss. We now identify an LKB1-regulated program that links metabolic 
alterations to control of the epigenome and is involved in malignant 
growth. Our results provide evidence that coupled metabolic and epige- 
netic states have a more general role in cancer pathogenesis and suggest 
therapeutic strategies that could target these intersecting processes. 


Synergy between LKBI and KRAS mutations 

LKB] inactivation frequently coincides with mutations in the RAS-RAF 
pathway in human cancers and these genetic alterations cooperate 
to drive tumorigenesis in genetically engineered mouse models 
(GEMMs)*!!)1416, We examined the interactions between oncogenic 
KRAS®!” and deletion of LKB1 in adult pancreatic ducts using a 
tamoxifen-inducible GEMM (Extended Data Fig. 1a). The combined 
alterations resulted in pancreatic cancers by 20-25 weeks, whereas the 
individual mutations had no pathological effects at this age (Fig. la and 
Extended Data Fig. 1b). To investigate the mechanisms of tumorigenesis, 
we isolated primary pancreatic ductal epithelial cells from mice with 
conditional KRASC!”P and LKB1 alleles (n= 2 lines per genotype) and 
transduced them with adenoviruses expressing Cre and/or Flp recom- 
binase to generate KRASS!2P’+ LKB1~/~— and KRASS&!2P/+,LKB1—/— 
cells (K, Land KL cells, respectively) as well as wild-type parental lines 
(Extended Data Fig. 1c). Only KL cells were tumorigenic following 
injection into severe combined immunodeficient (SCID) mice or 
growth in soft agar, and tumorigenicity was blocked by restoration of 
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Figure 1 | LKB1 inactivation synergizes with KRAS°'”” to potentiate 
glycolysis, serine metabolism, and tumorigenesis. a, Representative 
pancreas histology of the indicated genotypes of mice at 20-25 weeks 
(n=4 per genotype). b, Subcutaneous tumour growth of KL cells 
expressing empty vector (KL) or LKB1 (rescue) (n =8 per group). 

c, d, Ductal cells tested for glucose uptake (c; n = 6, independent 
replicates), and lactate secretion (d; n=3). e, Oxygen consumption rates 


wild-type LKB1 (Fig. 1b and Extended Data Fig. 1d-g). In vitro, KL 
cells showed greater proliferation than K cells, and both showed greater 
proliferation than wild-type or L cells (Extended Data Fig. 1h-j). Thus, 
primary ductal cells provide a tractable in vitro system to study mecha- 
nisms of epithelial cell transformation arising from LKB1 inactivation. 

Focusing on the metabolic alterations provoked by loss of LKB1, we 
found that KL cells exhibited an approximately 30% increase in glucose 
uptake compared to K cells, and showed marked elevations in levels of 
the GLUT] transporter and ATP (Fig. 1c and Extended Data Fig. 1k, 1). 
Lactate levels were elevated in KL cells, whereas oxygen consumption 
and citrate levels were reduced (Fig. 1d, e and Extended Data Fig. 1m). 
Moreover, KL cells showed heightened sensitivity to acute glucose 
deprivation and to inhibition of glycolysis using the glucose analogue 
2-deoxyglucose, the pyruvate dehydrogenase kinase inhibitor dichlo- 
roacetate, or the lactate dehydrogenase inhibitor galloflavin (Extended 
Data Fig. In-q). Importantly, neither KRAS°”” nor LKB1 inactiva- 
tion alone promoted significant alterations in glucose metabolism 
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Figure 2 | Activation of de novo serine biosynthesis supports growth of 
LKB1-deficient cells. a, Serine biosynthesis pathway. Red, upregulated 
in KL cells. b, Serine pathway gene expression (n = 8 per genotype). 

c, Isotopomer abundance of [!°N]glutamine-derived M+1 serine and 
glycine (n = 3, biological replicates). d, e, Three-day growth of ductal 
cells cultured with or without 0.4 mM serine (d; n = 20) or transduced 
with the indicated shRNAs (e; n = 6). f, Six-day proliferation of KL cells 
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ctor 


in K and KL cells under nutrient-replete conditions (n= 21). f, Fates of 
glycolytic intermediates. g, GSEA showing enrichment of serine-glycine- 
one-carbon network'® (K cells, n = 3; KL cells, n = 4). NES, normalized 
enrichment score. h, i, Isotopomer abundance of U['*C]glucose-derived 
M-+3 pyruvate (h) or serine (i; n = 3, biological replicates). Data pooled 
from three (e) or representative of two (d) experiments. Error bars, 

s.e.m. (b), s.d. (¢, d, g, h). *P < 0.05, **P < 0.01, ***P < 0.001. 


(Extended Data Fig. 1r-t). Thus, these genetic lesions acted synergis- 
tically to potentiate glycolysis while rendering cells highly dependent 
on glucose availability. 

These data suggested that an increased supply of glycolytic inter- 
mediates was available for anabolic processes to support the growth 
of KL cells (Fig. 1f). Notably, gene set enrichment analysis (GSEA) of 
RNA-sequencing (RNA-seq) and quantitative proteomics'” data indi- 
cated that KL cells are enriched for glycolytic enzymes and for net- 
works that connect glycolytic intermediates to one-carbon metabolism, 
with serine-glycine—-threonine and folate metabolism scoring highly 
among the induced pathways (Supplementary Data Table 1). There 
was particularly striking enrichment of a 64-gene signature defining 
the entire serine-glycine-one carbon (SGOC) network", indicating 
strong coordinate activation of these pathways (Fig. 1g and Extended 
Data Fig. 2a). Accordingly, the use of uniformly carbon-13-labelled 
glucose (UL? C]glucose) demonstrated that KL cells show augmented 
production of glucose-derived pyruvate and lactate and an even more 
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transduced with the indicated shRNAs and expression constructs (n = 3). 
g, Subcutaneous growth of tumours from KL cells transduced with the 
indicated shRNAs (n= 12 tumours per group). h, Proportion of CK19T 
tumours cells that are PCNA* (shControl n= 4, shPSAT1-1 n=4, 
shPSAT1-2 n = 3, representative tumours). Data pooled from four (b) or 
representative of two (e, f) or four (d) experiments. Error bars: s.d. (b-f), 
s.e.m. (g, h). *P < 0.05, **P< 0.01, ***P < 0.001. 
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Figure 3 | Activation of the SGOC network promotes DNA methylation 
in LKB1-deficient cells. a, SGOC network. b, SAM levels in ductal cells 
transduced with the indicated shRNAs (n= 4). c, Three-day growth of 

KL cells with or without SAM supplementation or PSAT1 knockdown 
(n= 16). d, Proliferation of ductal cells treated with 3-deazaadenosine 


pronounced increase in serine and glycine biosynthesis rates without 
changes in total levels of these amino acids (Fig. 1h, i and Extended 
Data Fig. 2b-d). 


Serine pathway dependence of KL cells 

Multiple serine pathway enzymes (PSAT1, PSPH, SHMT1 and 
SHMT2) were upregulated in KL cells, whereas restoration of LKB1 
expression reversed these changes, broadly suppressed the entire SGOC 
network, and reduced serine biosynthesis (Fig. 2a, b and Extended 
Data Fig. 2e-g). PSAT1 catalyses the transamination of 3-phospho- 
hydroxy-pyruvate (3PHP) to 3-phosphoserine (3PS), with glutamate as 
the nitrogen donor and a-ketoglutarate (a-KG) as a secondary prod- 
uct (Fig. 2a). Consistent with elevated PSAT1 activity, [!°N]glutamine 
labelling revealed a marked increase in nitrogen incorporation into 
serine and glycine in KL cells (Fig. 2c). Thus, LKB1 restricts serine 
metabolism. 

Consistent with this finding, KL cells were unaffected by culturing 
in serine-free medium, whereas K cells showed a roughly 40% decrease 
in proliferation, as did KL cells in which LKB1 was expressed (Fig. 2d 
and Extended Data Fig. 3a). On the other hand, KL cells were specifi- 
cally sensitive to PSAT1 knockdown, exhibiting reduced proliferation 
in normal medium, restored dependency on exogenous serine, and 
impaired colony formation in soft agar (Fig. 2e and Extended Data 
Fig. 3b-d). Introduction of short hairpin (sh)RNA-resistant human 
PSAT1 cDNA rescued these phenotypes (Fig. 2f and Extended Data 
Fig. 3e). Moreover, PSAT1 knockdown strongly inhibited subcutaneous 
tumour growth by KL cells (Fig. 2g, h and Extended Data Fig. 3f, g). 
Notably, PSAT1 knockdown had no effect on the proliferation or tum- 
origenicity of cell lines from pancreatic cancer GEMMs with wild- 
type LKB1 (KRAS©!?-p53 null, KPC and KRAS°”?-CDKN2A null, 
KIC)’, supporting the idea that these effects result from LKB1 loss 
rather than being secondary to high proliferation rates (Extended Data 
Fig. 3b, h-k). Thus, increased serine biosynthesis pathway activity is 
required to drive oncogenic transformation in KL cells. 


Serine biosynthesis fuels DNA methylation 

The serine biosynthesis pathway can fuel a range of anabolic pro- 
cesses, including generation of N°,N'°-methylene-tetrahydrofolate 
(THF), which supports NADPH production, redox homeostasis, and 
nucleotide biosynthesis”°. PSAT1 knockdown failed to increase levels 
of reactive oxygen species (ROS) in KL cells and, correspondingly, 
proliferation was not restored by the ROS scavenger N-acetylcysteine 
(NAC) or by nucleoside supplementation (Extended Data Fig. 4a—d). 


(3-DZA) (n= 12). e, Heatmap of differentially regulated SAM-using 
enzymes (proteomics). f, 5mC in ductal cells (77-177 cells). Data pooled 
from two (c, d, f) or representative of two (b) experiments. Error bars: 
s.d. (b-d), s.e.m. (f). *P< 0.05, **P< 0.01, ***P< 0.001. 


Similar results were obtained after treatment with aminooxyacetate 
(AOA), an inhibitor of PSAT1 and other aminotransferases (Extended 
Data Fig. 4e, f). Thus, the requirement of KL cells for PSAT1 appears 
to be independent of its roles in redox homeostasis or maintenance of 
nucleoside pools. 

Serine metabolism can also fuel the methionine salvage pathway 
(Fig. 3a), which is the main mechanism of production for the methyl 
donor S-adenosy! methionine (SAM)*!*. Notably, multiple enzymes 
that channel serine metabolism intermediates into the methionine 
salvage pathway and that contribute to SAM biosynthesis were upreg- 
ulated in KL cells (Extended Fig. 4g). Accordingly, PSAT1 knockdown 
reduced SAM levels in KL cells but not in K cells (Fig. 3b). Furthermore, 
SAM supplementation mitigated the proliferation defects caused by 
PSAT1 ablation in KL cells (Fig. 3c). Conversely, suppression of SAM 
biosynthesis by inhibition of S-adenosyl-homocysteine-hydrolase 
(SAH) with 3-deazaadenosine (3DZA), or of methionine-adenosyl- 
transferase (MAT) with cycloleucine, slowed the growth of KL cells, 
whereas K cells remained unaffected (Fig. 3d and Extended Data 
Fig. 4h). Thus, activation of de novo serine biosynthesis contributes to 
SAM production to support growth of LKB1-deficient cells, possibly 
by providing active methyl groups in the form of N°-methyl-THE or by 
promoting ATP generation (for example, via PSAT1-dependent TCA 
cycle anaplerosis)**”°. 

SAM is the substrate for methylation of lipids, DNA, RNA, 
metabolites and proteins. Examination of our RNA-seq and proteomics 
data sets for the expression of the 183 annotated SAM-dependent 
methyltransferase enzymes revealed significant upregulation of the 
DNA methyltransferases DNMT1 and DNMT3A in KL cells, whereas 
few of the other enzymes showed any changes (Fig. 3e, Extended Data 
Fig. 5a and Supplementary Data Table 2). Quantitative PCR and immu- 
noblot analyses verified the regulation of expression of these enzymes 
by LKB1 (Extended Data Fig. 5b, c). Importantly, inhibition of DNA 
methylation using 5-aza-2-deoxycytidine (decitabine) or the non- 
nucleoside DNMT inhibitor RG108 promoted SAM accumulation in KL 
cells but not in K cells (Extended Data Fig. 5d), suggesting that DNMTs 
are major SAM-utilizing enzymes in the context of LKB1 deficiency. 

Accordingly, immunofluorescence and DNA dot blot analysis demon- 
strated that KL cells had a significant increase in 5-methylcytosine 
(5mC) as compared to K cells, an effect that was reversed by LKB1 
rescue (Extended Data Fig. 5e—g). By contrast, there were no consistent 
changes in global levels of a series of histone methylation marks, in 
5-hydroxy-methylcytosine, or in Ne-methyladenosine (Extended Data 
Fig. 5h-i and data not shown). Notably, PSAT1 knockdown suppressed 
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Figure 4 | The LKB1-PSAT1 pathway controls methylation and 
expression of retrotransposons. a, Distribution of methylation levels 

in ductal cells. Median values are indicated. b, Heatmap of differentially 
methylated tiles. c, d, Distribution of methylation levels in LINE] (c) and 
LTR (d) repeats. e, Expression of LINE1 in ductal cells, tested using two 
independent primer sets (n = 3). f, g, KL cells transduced with EV or LKB1 


DNA methylation in KL cells, and this effect was reversed by shRNA- 
resistant PSAT1 cDNA, while K and KPC controls were unaffected. 
(Fig. 3f and Extended Data Fig. 6a-c). Comparison of all four genotypes 
using our set of isogenic ductal cells confirmed that serine pathway 
activity and DNA methylation were specifically potentiated upon dual 
KRASS!” expression and LKB1 loss (Extended Data Fig. 6d-g). 

The expression of a kinase-dead (KD) LKB1 mutant confirmed that 
the activation of key transcriptional and metabolic circuits in KL cells 
was due to loss of LKB1 kinase activity (Extended Data Fig. 7a-e). 
Accordingly, KL cells showed reduced activity of AMPK, an important 
target of the LKB1 kinase, and consequent activation of mechanistic 
target of rapamycin complex I (mTORC1) (Extended Data Fig. 7f, w). 
Moreover, in K cells, AMPK suppression using the small molecule 
inhibitor dorsomorphin (compound C) or shRNAs targeting AMPKal 
or AMPKa2 increased each of the metabolic and gene expression 
parameters and upregulated global DNA methylation levels, mimicking 
the effects of LKB1 loss (Extended Data Fig. 7g—q). Additionally, 
AMPK silencing blocked the effects of LKB1 re-expression on the 
growth and metabolic properties of KL cells (Extended Data Fig. 7r-u). 
Finally, KL cells were hypersensitive to mTOR inhibition using Torin 1, 
which suppressed the metabolic and transcriptional changes seen in 
these cells (Extended Data Fig. 7v-z). Thus, deregulation of AMPK- 
mTORCI signalling contributes to the altered metabolic and epigenetic 
network in KL cells. 


LKB1 loss silences retrotransposons 

DNA methylation can contribute to transcriptional regulation and 
maintenance of genomic stability**, We mapped methylation changes 
by conducting whole-genome bisulphite sequencing (WGBS) of K 
and KL cells (two independently derived lines per genotype). The data 
demonstrated a marked increase in mean CpG methylation in KL 
cells (Fig. 4a and Extended Data Fig. 8a). Comparison of methylation 
levels at non-repetitive 100-bp tiles revealed 3,395 hypermethylated 
and 1,270 hypomethylated regions in KL cells (FDR < 0.05, methyl- 
ation difference >0.1; Fig. 4b and Extended Data Fig. 8b). However, 
the genes associated with differentially methylated regions had only 
modest overlap with those showing differential expression, suggesting 
that methylation of these elements may not prominently affect tran- 
scriptional regulation (Extended Data Fig. 8c). Thus, we focused on 
the repetitive portion of the genome. Notably, methylation in KL cells 
was particularly enriched at retrotransposon repeats (long terminal 
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assessed for methylation of LINE] and 18S (f; n= 6, LINE1-1 and 18S; 
n=4, LINE1-2), and LINE] expression (g; n= 6). h, i, KL cells transduced 
with the indicated shRNAs were assessed for LINE] and 18S methylation 
(h; n= 6) and LINE1 expression (i; n = 10). Data pooled from two (f, h) or 
representative of two (e, g, i) experiments. Error bars: s.d. *P < 0.05, 

** P< 0.01, ***P< 0.001. 


repeats (LTRs), long interspersed nuclear elements (LINEs) and short 
interspersed nuclear elements (SINEs)) compared to non-repeat ele- 
ments (promoters, CpG islands and shores) (Fig. 4c, d and Extended 
Data Fig. 8d-k). These elements comprise a large proportion of the 
genome, have promoters that can be actively transcribed when unmeth- 
ylated, and can influence regulation of host genes”. Consistent with 
a functional role for the observed retrotransposon methylation, KL 
cells had a > 50% reduction in LINE-1 expression compared to K cells 
(Fig. 4e). Moreover, restoration of LKB1 suppressed LINE-1 methyla- 
tion, as determined by 5mC-DNA immunoprecipitation—PCR analysis 
(meDIP-qPCR), and led to an approximately twofold upregulation of 
LINE-1 transcript expression (Fig. 4f, g). Importantly, PSAT1 knock- 
down also reduced LINE-1 methylation and induced its expression 
in KL cells (Fig. 4h, i), indicating that increased serine metabolism 
supports retrotransposon methylation and transcriptional silencing 
downstream of LKB1 loss. 

Because retrotransposons can function as important modulators 
of host gene expression”, we explored the correlation between dif- 
ferentially expressed genes in K versus KL cells and the presence of 
linked retrotransposons. Retrotransposon elements were significantly 
enriched (x? test, P< 0.0001) in the gene bodies of differentially 
expressed genes as compared to the distribution of these elements 
across all genes (Extended Data Fig. 81, m). By contrast, minimal 
enrichment was observed when promoter regions were considered. 
Such specificity is notable in light of evidence that intragenic methyl- 
ation (including repeat methylation) can modify the activity of linked 
promoters and affect RNA processing”*”°. Thus, the LKB1-SGOC 
pathway alters the epigenetic landscape and dynamically regulates 
retrotransposon methylation and transcriptional activity, changes that 
appear to be associated with differences in host gene transcription. 


KL cells are sensitive to DNMT inhibition 

Consistent with the functional relevance of increased DNA meth- 
ylation, KL cells were sensitive to shaRNA-mediated knockdown of 
DNMT1 or DNMT3A, whereas K or KPC cells remained largely unaf- 
fected (Extended Data Fig. 9a—c). Similar specific sensitivity of KL 
cells was observed in vivo using doxcycline (dox)-inducible shRNAs 
to acutely deplete DNMT1 or DNMT3 (or shGFP control) once subcu- 
taneous tumours reached 50 mm’ in size (Fig. 5a-c and Extended Data 
Fig. 9d-f). These data indicate that inhibition of DNA methylation is a 
potential vulnerability of KL cells. Indeed, KL cells were hypersensitive 
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Figure 5 | LKB1 deficiency confers hypersensitivity to DNA 
methylation inhibitors. a—c, Volume of subcutaneous tumours derived 
from KL cells transduced with Dox-inducible shRNAs (n = 6 tumours 

per condition). d, Decitabine ICs9 for K, KL or KPC cells (n= 4). 

e-g, Decitabine treatment of KL xenografts (n = 20 tumours per condition). 
e, Tumour growth. f, Proportion of CK19* tumour cells that are PCNAt 
(n=4 representative tumours), and ratio of cleaved-caspase 3 (CC3) to 
DAPI staining (n = 4 representative tumours). g, Haematoxylin and eosin 


to the clinically approved DNMT inhibitor decitabine, as compared 
to K and KPC cell lines (IC59 KL: 2nM; K: 138nM; KPC: 258 nM) 
(Fig. 5d and Extended Data Fig. 9g-i). Similar results were obtained 
using additional DNMT inhibitors (RG108, EGCG and SGI1027; 
Extended Data Fig. 9j-m). Importantly, decitabine treatment caused 
striking regression of subcutaneous KL tumours, associated with 
necrosis, hyalinization, replacement of tumour glands with fibrosis, 
decreased tumour cell proliferation, and pronounced apoptosis 
(Fig. 5e-g). By contrast, KPC xenografts were unaffected by decitabine 
(Extended Data Fig. 9n-q). Thus, LKB1 deficiency confers specific 
hypersensitivity to inhibition of DNA methylation in vitro and in vivo. 

Available human LKB1 mutant pancreatic cancer cell lines 
(COLO357 and SNU324) showed similar vulnerabilities. They exhib- 
ited 70-90% inhibition of proliferation following PSAT1 knockdown, 
whereas a set of LKB1 wild-type pancreatic cancer lines showed 
modest responses (Extended Data Fig. 10a), consistent with prior 
results”’ in LKB1 wild-type lines. PSAT1 inactivation also inhibited 
the growth of LKB1 mutant COLO357 xenografts but not LKB1 
wild-type MIAPACA2 xenografts (Extended Data Fig. 10b). Moreover, 
LKBI restoration or PSAT1 silencing decreased global methylation in 
COLO357 cells but not in PANC1 or PATU-8988T cells (Extended Data 
Fig. 10c, d). LKB1 mutant lines also exhibited an increased response 
to decitabine in vitro and in vivo, and to the methionine salvage path- 
way inhibitors 3DZA and cycloleucine (Fig. 5h, i and Extended Data 
Fig. 10e). Thus, both mouse and human LKB1 mutant pancreatic 
tumour cells are sensitized to inhibition of the serine pathway and to 
DNMT inhibition. 


Discussion 
In summary, we have defined a metabolic state central to tumorigenesis 
resulting from LKB1 loss in which glucose and glutamine-derived 


staining of representative tumours (top; arrows: tumour glands, dotted 
line: hyalinization), and staining for the indicated markers (quantified in f). 
Insets: threefold magnification. h, Decitabine ICs9 in human pancreatic 
cancer cell lines (red, LKB1 mutant; black, LKB1 wild type). i, Growth 

of COLO357 xenografts treated with decitabine (1 mg kg ') or vehicle 
(n=6 per group). Data pooled from three (h) or representative of two (d) 
experiments. Error bars: s.e.m. (a-c, i), s.d. (d-f). *P < 0.05, **P< 0.01, 
FEED < (0.001. 


intermediates are channelled towards the SGOC network leading to 
increased DNA methylation (Extended Data Fig. 10f). Such interplay 
between metabolic control and epigenetic reprogramming has been 
proposed as a key mechanism for cancer development in the context 
of mutations in metabolic enzymes!”». The present work provides 
evidence for a broader role of metabolic and epigenetic crosstalk in 
cancer pathogenesis, revealing that LKB1 mutant pancreatic cancer 
cells have a marked dependency on pathways linking glycolysis, serine 
metabolism and DNA methylation. 

It appears probable that the influence of aberrant metabolism on epige- 
netics contributes to cancer in the context of other oncogenic mutations. 
For instance, PI3K/AKT signalling controls acetyl-coenzyme A 
metabolism to support histone acetylation and regulate growth- 
promoting genes”*. Numerous oncogenic pathways rewire metabolism, 
and the circuits that are activated vary widely, with different genetic 
lesions favouring distinct fates of glucose, glutamine, fatty acids, and 
other nutrients. Thus, the resulting alterations in the levels of metabolites 
affecting chromatin regulation (for example, NAD*/NADH, FAD, 
O-linked N-acetylglucosamine, free fatty acids, SAM and acetyl-CoA) 
may be fundamental to cancer-promotion. Varying conditions in the 
tumour microenvironment may alter tumour phenotypes via similar 
processes”?*°, 

The gain in DNA methylation at retrotransposons that we observed 
in KL cells is notable in light of the emerging view that these abundant 
repeat elements serve critical roles in host gene regulation”’. Moreover, 
there is increasing evidence that reactivation of silenced retrotransposons 
may underlie the therapeutic benefit of DNMT inhibitors, serving to 
induce a type I interferon anti-viral defence program*!**. The basis for 
the widespread differences in response to DNMT inhibitors observed 
in vitro and in the clinic remains unclear. In this regard, our studies 
in KRAS mutant pancreatic ductal cells suggest that LKB1 status is a 
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genetic marker for DNMT inhibitor responsiveness. Because repeat 
element methylation does not strictly correlate with sensitivity to these 
treatments*’, the precise mechanisms of the specific response of KL cells 
to DNMT inhibitors remain to be defined. In this setting, it will be of 
interest to determine the relative contributions of IFN pathway activa- 
tion and of modulation of host gene expression. Together, our data point 
to novel therapeutic vulnerabilities in the context of LKB1 mutations, 
suggesting that it might be possible to use agents targeting nodes of this 
network in defined patient subsets, although additional studies will be 
needed to broadly establish these associations in human cancers. 

Our findings and other recent reports highlight the importance 
in cancer of the deregulation of serine biosynthesis, which can result 
from amplification of genes encoding pathway enzymes”*** or their 
transcriptional activation downstream of oncogenic drivers, including 
NRE2?°, c-MYC**38, and mTOR?’. Serine metabolism can fuel 
diverse biosynthetic pathways and contribute to energy production. 
Correspondingly, its functional role in aberrant growth appears context 
dependent, relating variously to supporting nucleotide biosynthesis, 
NADPH production, TCA cycle anaplerosis, and DNA or histone 
methylation. These findings underscore the emerging interest in devel- 
oping new approaches to targeting components of the SGOC network 
as cancer therapeutics. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Extended Data Figure 1 | LKB1 suppresses KRAS©!?-driven 
tumorigenesis and limits glycolysis in primary pancreatic ductal 
epithelial cells. a, Schematic of GEM models. Sox9-Cre™®, LKB1", 
and LSL-KRASS!”/* mice were crossed to generate four cohorts: WT 
(Sox9-Cre®), L (Sox9-Cre®®;LKB1!"), K (Sox9-Cre®®;LSL-KRASCD!*) 
and KL (Sox9-Cre®®;LSL-KRASS!2P’/+;,LKB1!"), Genetic lesions were 
induced by intraperitoneal injections of tamoxifen at 6 weeks of age, 
after which mice were observed for signs of disease and killed when 
KL animals were moribund (20-25 weeks of age). The WT, K, and L 
mice had no signs of illness or other abnormalities at this time point. 
b, Haematoxylin and eosin-stained sections of representative pancreata 
from WT and L mice (n =4 mice per group). Scale bars, 50 jum. 
c, Schematic of primary pancreatic ductal epithelial cell system. 
Pancreatic ductal epithelial cells were isolated from LSL-KRASS?P!+ 
and LSL-KRASS!29/+ .LKB1"" mice and infected with Adeno-Cre 
to generate K and KL cells. For studies comparing K, L, KL and WT 
genotypes, cells were isolated from FSF-KRASC?)/*;LKB1!" mice and 
infected with Adeno-Flipase and/or Adeno-Cre, or neither. d, Volume 
and e, weight of subcutaneous tumours derived from ductal cells of the 
indicated genotypes (n = 4 tumours per group). Error bars show s.e.m. 
For source data on tumour volume, see Supplementary Data Table 4. 
f, Weight of subcutaneous tumours from K (n= 6), KL (n=8) and KL 
cells transduced with retrovirus expressing LKB1 cDNA (rescue, n= 8). 
g, Number of colonies formed in soft agar by K (none detected) or KL 
cells (n = 6 independent biological replicates). Error bars show s.e.m. 
h, Proliferation of K and KL cells in nutrient-replete medium (n = 4). 
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i, Proliferation of KL cells transduced with retroviruses expressing 

empty vector or LKB1 (rescue; n = 3). j, Proliferation of wild-type (WT), 
KRASS!2)/+ (K), LKB1~/~ (L) and KRAS°!?)’+ ;LKB1~/~ (KL) cells 
(n=6). k-t, In vitro studies of K and KL cells. k, Detection of GLUT1 
(SLC2A1) by immunofluorescence (scale bar, 201m) or immunoblot. 
2-(4-amidinophenyl)-1H-indole-6-carboxamidine (DAPI) was used to 
visualize nuclei. Actin was used as the loading control. For gel source 
images see Supplementary Data Fig. 1.1, Steady-state ATP levels under 
nutrient-replete conditions measured by CellTiterGlo (Promega), 
normalized to cell number and expressed as relative to ATP levels in 

K cells (n=4). m, Intracellular levels of pyruvate, lactate, and TCA cycle 
metabolites as detected by GC-MS. Values are normalized to cell number. 
Data are expressed as relative to the levels in K cells (n =6 biological 
replicates). n, Three-day proliferation of cells in 25 mM glucose or acutely 
switched to media with the indicated reduced glucose concentrations 

(n= 6). Data are expressed as relative to day 0. o-gq, Proliferation of cells 
treated with 5 mM 2-deoxyglucose (2DG) (0), 5 mM dichloroacetate 
(DCA) (p) or 20M galloflavin (gallo) (q). Values are expressed as 
percentage of normal growth (2DG n=6, DCA n=8, gallo n=6). 

r-t, WT, K, Land KL cells were measured for glucose uptake using 
2NBDG (r, n =6), lactate release into the medium (s, n = 4), and expression 
of glycolytic genes (t, n = 6). Data are pooled from two (j, 1, n-r) or three (t) 
experiments or representative of two (h, i, s) experiments. For all panels, 
error bars show s.d. unless otherwise stated; *P < 0.05, **P< 0.01, 

**EP < 0.001. 
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Extended Data Figure 2 | LKB1 loss induces the serine-glycine-one- 
carbon network. a, GSEA showing enrichment of proteins involved in 
the serine-glycine-one-carbon network’ in KL cells compared to K cells 
using global proteomics (n= 4 samples per group). b, Plot of isotopomer 
abundance of U[3C]glucose-derived intracellular M+3 lactate over time 
(n=3 independent biological replicates). c, Plot of isotopomer abundance 
of U[*C] glucose-derived intracellular M+2 glycine over time (n = 3 
independent biological replicates). d, Intracellular levels of serine, glycine 
and glutamine as detected by GC-MS. Values normalized to cell number. 
Data are expressed as relative to the levels in K cells (n =6 biological 
replicates). e, PSAT1 and GLDC expression determined by immunoblot 


in Kand KL cells and in KL cells transduced with LKB1 cDNA. Actin was 
used as loading control. For gel source images see Supplementary Data 
Fig. 1. f, GSEA of RNA-seq data showing suppression of genes involved in 
glycolysis, serine biosynthesis, folate cycle and the serine-glycine-one- 
carbon network upon re-expression of LKB1 cDNA in KL cells (rescue, 
n= 2 samples) compared to parental KL cells (n= 4 samples). g, Plot 

of isotopomer abundance of U['*C]glucose-derived intracellular M+3 
serine 6h after addition of U['*C] glucose (n = 3 independent biological 
replicates). For all panels, error bars show s.d. unless otherwise stated; 

*P < 0.05, **P< 0.01, ***P< 0.001. 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | The de novo serine biosynthesis pathway is 
required for KL but not KPC tumour growth. a, Proliferation of KL 
cells transduced with vector or LKB1 cDNA and cultured in the presence 
or absence of 0.4mM serine. Growth is expressed as relative to day 0 
(n=3). LKB1 re-expression slows growth and results in sensitivity to 
serine deprivation. b, (RT-PCR showing effective knockdown of PSAT1 
in K, KL, KPC and KIC cells transduced with shControl or two different 
shRNAs against PSAT1 (n= 2). Data are expressed as relative to shControl 
for each cell line. 18S rRNA was used for normalization. c, Number of 
colonies formed in soft agar by KL cells transduced with shControl or two 
different shRNAs against PSAT1 (mn =6). d, Proliferation of K or KL cells 
transduced with shControl or two independent shRNAs against PSAT1 
in the absence of serine. Data are expressed as percentage of growth in 
the presence of 0.4 mM serine (n =6). e, (RT-PCR showing effective 
knockdown of endogenous mouse PSAT1 (mPSAT1, endogenous) and 
forced expression of human PSAT1 (hPSAT1, exogenous) in KL cells 
transduced with shControl, shPSAT1-1, shPSAT1-2, vector or hPSAT1 
cDNA. Data are normalized to 18s rRNA (n= 2). f, Weight at the time 

of harvesting of subcutaneous tumours from KL cells transduced 

with shControl (n= 8), shPSAT1-1 (n= 12), or shSPAT1-2 (n= 12). 
Error bars show s.e.m. g, Haematoxylin and eosin-stained sections 

and immunofluorescence analysis of representative tumours derived 
from subcutaneous injections of KL cells transduced with shControl, 


shPSAT1-1, or shPSAT1-2. Note that PSAT1 knockdown tumours have 

a reduction in malignant glands (arrows) relative to the fibrotic stroma. 
Lower panels: anti-CK19 (green) was used to visualize the neoplastic 
epithelium and anti-PCNA (red) was used to mark proliferation. DAPI 
was used to stain nuclei (blue). h, Proliferation of KPC and KIC pancreatic 
cancer cells transduced with shControl or two independent shRNAs 
against PSAT1. Growth is expressed as relative to day 0 (n= 6 for KPC and 
n= 4 for KIC). i-k, KPC cells were transduced with shControl (n = 6), 
shPSAT1-1 (n =6) or shSPAT1-2 (n= 4) and injected into SCID mice. 

i, Volume (left) and weight (right) of subcutaneous tumours. Error bars 
show s.e.m. For source data on tumour volume, see Supplementary Data 
Table 4. j, k, Tumours in i were stained using anti-CK19 antibody (green) 
to visualize the neoplastic epithelium and anti-PCNA staining (red) 

to mark proliferating cells. DAPI was used to stain nuclei (blue). 

The proportion of stained CK19* cells is quantified in j (n= 4, 
representative tumours) and CK19* cells with nuclear PCNA staining 
are quantified in j. There are no significant effects on any of these 
parameters. k, Haematoxylin and eosin-stained sections (top) and 
immunofluorescence analysis (bottom) of representative tumours. Scale 
bars, 100m. Insets show threefold magnification. Data pooled from two 
(c, d) or representative of two (a, b, e) or three (h) experiments. For all 
panels, error bars show s.d. unless otherwise stated; *P < 0.05, 

** P< 0.01, ***P < 0.001. 
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Extended Data Figure 4 | Characterization of serine-glycine-one- 
carbon pathway in KL cells. a, Detailed graph of SGOC network. 
Enzymatic inhibitors used in this study are marked in red. b, ROS in K or 
KL cells transduced with shControl, shPSAT1-1 or shPSAT1-2 measured 
by DCFDA (left) and CellRox staining (right). Data are normalized to cell 
number (n= 4). c, Six-day proliferation assay of KL cells transduced with 
shControl, shPSAT 1-1 or shPSAT 1-2, showing lack of growth rescue by 
N-acetylcysteine (NAC). Data are expressed as relative to day 0 (n=6). 
d, Three-day growth assay of KL cells transduced with shControl, 
shPSAT1-1 or shPSAT 1-2, showing the lack of rescue by excess 
nucleosides (adenosine, guanosine, thymidine, uridine, cytidine; 1 mM 
each). Data are presented as percentage of the growth of shControl cells 


(n= 16). e, Proliferation of K or KL cells treated with aminooxyacetate 
(AOA). Data are expressed as relative to day 0 (n= 8). f, Five-day 
proliferation of K or KL cells treated with AOA and/or NAC. Data are 
expressed as relative to day 0 (n=6). g, Data from RNA-seq (left) and 
quantitative proteomics (right) showing levels of genes involved in 

the production of SAM in K and KL. The data plotted are expressed as 
mean-centred values. h, Proliferation of K or KL cells treated with 2 mM 
cycloleucine. Data are expressed as percentage of the growth of vehicle 
treated cells (n = 12). Data are pooled from two (b-f, h) experiments. 
For all panels, error bars show s.d. unless otherwise stated; *P < 0.05, 
**EP < 0.01, ***P < 0.001. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


a b Cc K vs KL 
6 0 1.5 _ 100 
7] K KL rescue x) K cells KL cells 3 80 = 
d a E a s 
2 £05 o 
= ~0. 0 ri 
« a 
3 + 
S 
1) g $ x h 
22.0 aS 1.0 
ey Pn DNA (ng) & 3 ieee : ak 
2 =| ® — —e@ |H3K36me3 4 
oO 1.5 250 r ef = . - me. 9 : KL 
£ — em | H3K27me3 sO 
5 1.0 @ O\smc 8% Bs 
20.5 e 2 e 
3 62.5). 3 
0.0 © | total a0 0 
ty : se 2° ¢ 2 
wv < 
Rr a 
ee 
28 * . 
[7 
> 
26 
te. oO 
shmc = Eg 
wo 
oO 
22 
& 
total 2 
ty 


Extended Data Figure 5 | Deletion of LKB1 induces DNMT1 and 
DNMTS3A expression and increases global DNA methylation. a, Heat 
map of RNA-seq data showing levels of the differentially regulated SAM- 
using enzymes. Plotted data are expressed as mean centred values. 

b, Expression of DNMT1 and DNMT3A in K, KL, and KL cells transduced 
with LKB1 cDNA (rescue) was measured by qRT-PCR. Levels were 
normalized to 18S rRNA. Data are expressed as relative to K cells (n= 4, 
representative of two experiments). c, Immunoblots of lysates from K, 

KL or rescue cells were probed for DNMT1 or DNMT3A. Actin was 

used as loading control. For gel source images see Supplementary Data 
Fig. 1. d, Measurement of SAM in K and KL cells treated with 5-aza-2- 
deoxycytidine (decitabine) or RG108 for 3 days. In each case, data are 
expressed as relative to the amount of SAM in vehicle-treated K cells, 
which was arbitrarily set to 1 (n = 6 independent replicates). 

e, Immunofluorescence staining and quantitation of 5mC in K or KL cells 


(77-130 cells). Scale bar, 251m. f, Dot blot of DNA isolated from K or KL 
cells probed with anti-5mC antibody. Quantified signal was normalized 
to total DNA as measured by methylene blue staining (n = 4, independent 
replicates). g, Dot blot of DNA isolated from KL cells transduced with 
empty vector or LKB1 cDNA probed with anti-5mC antibody. Quantified 
signal was normalized to total DNA as measured by methylene blue 
staining (n = 3, independent replicates). h, Immunoblot analysis of 
histone 3 (H3) methyl marks from K or KL cells. Data are normalized 

to total H3 (K4me3, n= 2; K27me3, n=5; K36me3, n=5, independent 
replicates). For gel source images see Supplementary Data Fig. 1. i, Dot 
blot of DNA isolated from K or KL cells probed with anti-ShmC antibody. 
Quantified signal was normalized to total DNA as measured by methylene 
blue staining (n = 4, independent replicates). For all panels, error bars 
show s.d. unless otherwise stated; *P < 0.05, **P< 0.01, ***P< 0.001. 
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Extended Data Figure 6 | Serine pathway activity sustains DNA 
methylation in KL cells. a, Dot blot of DNA isolated from K or KL cells 
transduced with shControl, shPSAT1-1 or shPSAT1-2 probed with anti- 
5mC antibody. Total DNA was visualized by methylene blue staining. 
Graph shows quantified signal normalized to total DNA as measured 

by methylene blue staining (K cells, n = 4; KL cells, n = 8, independent 
replicates). b, Dot blot of DNA isolated from KPC cells transduced with 
shControl, shPSAT1-1 or shPSAT1-2 probed with anti-5mC antibody. 
The graph shows quantified signal normalized to total DNA (n=4, 
independent replicates). c, Dot blot of DNA probed with anti-5mC 
antibody. DNA was isolated from KL cells first transduced with vector or 
human PSAT1 then transduced with shControl, shPSAT1-1 or shPSAT-2. 
Graph shows quantified signal normalized to total DNA as measured by 


methylene blue staining (n = 3, independent replicates). d, Expression 

of serine pathway genes and DNMT genes in WT, K, L or KL cells by 
qRT-PCR. Data are normalized to 18S and expressed as relative to K 
cells (n= 6, pooled data from two experiments). e, Plots of isotopomer 
abundance of U[*C]glucose-derived M+3 serine and M+2 glycine, 6h 
after addition of U[!?C] glucose (n= 6 independent biological replicates). 
f, Quantified DNA dot blot signal of DNA isolated from WT, K, L or 

KL cells probed with anti-5mC antibody normalized to total DNA as 
measured by methylene blue staining (n = 4, independent replicates). 

g, Five-day growth of WT, K, L or KL cells transduced with shControl or 
two shRNAs against PSAT1. Data are expressed as relative to day 0 (n=4, 
pooled from two experiments). For all panels, error bars show s.d. unless 
otherwise stated; *P< 0.05, **P< 0.01, ***P< 0.001. 
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Extended Data Figure 7 | See next page for cation. 
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Extended Data Figure 7 | LKB1-mediated regulation of glycolysis- 
SGOC-DMMT pathway involves the AMKP-mTOR axis. a-e, KL cells 
were transduced with vector, wild-type LKB1 or kinase-dead LKB1. 

a, Proliferation, expressed as relative to day 0 (n= 4). b, Glucose uptake 
measured using 2NBDG followed by fluorimetry (data normalized to cell 
number and expressed as relative to KL-vector cells (n = 8)). ¢, Lactate 
levels measured by fluorimetry 3 h after medium change, normalized to 
cell number (n=8). d, Oxygen consumption rates measured in normal 
duct medium, followed by injections with 441M oligomycin (O), 41M 
FCCP (F), or 44.M antimycin A (A) (n = 3). e, Expression of the indicated 
genes measured by qRT-PCR (n= 4), with levels normalized to 18S 
rRNA. Data are expressed as relative to KL-empty vector (EV) cells. 

f, Immunoblot of K, KL or KL cells transduced with LKB1 cDNA (rescue). 
For gel source images see Supplementary Data Fig. 1. g, Immunoblot 

of K cells treated overnight with 101M Compound C. For gel source 
images see Supplementary Data Fig. 1. h, i, Glucose uptake (h) and 3h 
lactate production (i) in the same cells as in h. Data are normalized to cell 
number and expressed as relative to control K cells (glucose uptake, n = 3; 
lactate, n = 4). j, Glucose uptake in K or KL cells treated with vehicle or 
104M Compound C. Data are normalized to cell number and expressed as 
relative to control K cells (n = 4). Note the blunted response of KL cells. 

k, qRT-PCR analysis of the indicated genes in the same cells as in h. Levels 
are normalized to 18S rRNA. Data are expressed as relative to control 

K cells (n =6). 1, Immunostaining for 5-mC (left) and quantification of 
staining (right) in K cells treated with vehicle or Compound C for 4 days 
(158-163 cells). m-q, K cells were transduced with shControl or shRNAs 
against AMPKal or AMPKa2. m, Glucose uptake. Data are normalized to 
cell number and expressed as relative to shControl-treated cells (n = 4). 
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n, Steady-state ATP levels measured with CellTiterGlo (Promega), 
normalized to cell number and expressed as relative to ATP levels in 
shControl-treated cells (n= 4). 0, Lactate levels measured by fluorimetry 

3 h after medium change. Data are normalized to cell number (n= 3). 

p, Immunostaining for 5-mC (left) and quantification of staining (right) 
(159-296 cells). q, Expression of the indicated genes determined by 
qRT-PCR. Levels are normalized to 18S rRNA. Data are expressed as 
relative to shControl cells (1 =4). r-u, KL cells were transduced with 
vector or an LKB1 cDNA and shControl or shRNAs against AMPKal 

or AMPKa2. r, Glucose uptake. Data are normalized to cell number and 
expressed as relative to vector-shControl cells (n =4). s, Steady-state ATP 
levels normalized to cell number and expressed as relative to ATP levels in 
EV-shControl cells (n= 4). t, Lactate levels 3 h after medium change. Data 
are normalized to cell number (n= 4). u, Proliferation expressed as relative 
to day 0 (n= 3). v, Impact of Torin 1 treatment on growth of K and KL cells 
(n=4). w, Immunoblot of K and KL cells treated with vehicle or 25nM 
Torin 1. For gel source images see Supplementary Data Fig. 1. x, Glucose 
uptake and 3h lactate production in K or KL cells treated overnight with 
25nM Torin 1. Data are normalized to cell number (n= 4). y, Isotopomer 
abundance of U[!?C]glucose-derived serine and glycine in the same 

cells as in r (n =3 independent replicates). Cells were labelled with 
U[?C]glucose for 6 h. z, Expression of the indicated genes in Torin-treated 
K (left) or KL cells (right) determined by qRT-PCR. Levels are normalized 
to 18S rRNA. Data are expressed as relative to control-treated cells (n= 4). 
Data are pooled from two (b, ¢, e, i-n, p-t, v, x, Z) or representative of two 
(a, d, h, 0) or three (u) experiments. Error bars show s.e.m. in d, s.d. in all 
other panels. *P < 0.05, **P < 0.01, ***P<0.001. 
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Extended Data Figure 8 | See next page for caption. 
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Extended Data Figure 8 | Loss of LKB1 increases DNA methylation 

in retrotransposon elements. a, Hierarchical clustering of K and KL 
samples based on methylation levels at 100-bp autosomal genomic tiles 
as measured by whole-genome bisulfite sequencing. b, Two-dimensional 
density plot for methylation at non-repetitive 100-bp autosomal genomic 
tiles in KL versus K cells. The yellow line shows the mean methylation of 
tiles. The dots represent differentially methylated tiles (FDR q-value 

< 0.05, methylation change > 0.1). There are 3,395 hypermethylated tiles 
and 1,270 hypomethylated tiles in KL cells versus K cells. c, Overlap of 
genes associated with differentially methylated regions and differentially 
regulated genes (hyper DMRs, hypermethylated regions; hypo DMRs, 
hypomethylated regions; DRGs, differentially regulated genes). 
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d-j, Distribution of methylation density within the low-CpG-promoters 
(LCPs), high-CpG-promoters (HCPs), islands, shores, exons, introns and 
LTRs. Numbers reflect median values. k, Average percentile change in 
methylation in the same elements as in d-j as well as LINEs and LTRs. 

1, Number of genes containing retrotransposon repeat elements (sum 

of LINEs, SINEs, LTRs) among the set of differentially regulated genes 
(DRGs) in K versus KL cells and among all genes. Note that the DRGs are 
enriched for the presence of retrotransposons in their gene bodies (top), 
but not in their promoters (middle) or 3’UTRs (bottom). m, Specific 
enrichment of LINE, SINE and LTR elements in the gene bodies of DRGs 
when compared to all genes in the genome. 
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Extended Data Figure 9 | See next page for caption. 
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Extended Data Figure 9 | LKB1 deficiency confers hypersensitivity to 
inhibitors of DNA methylation. a-c, Proliferation of K (a), KL (b) and 
KPC (c) cells transduced with shControl or two shRNAs against each of 
DNMT1 (D1) or DNMT3A (D3A). Data are expressed as relative to day 

0 (Kand KLn=6, KPC n=4). d-f, Volume of subcutaneous tumours 
derived from KPC cells transduced with doxcycline (Dox)-inducible 
shRNAs against DNMT1 or DNMT3A (n= 4). Doxcycline was introduced 
to the drinking water when tumours reached 125 mm*. Error bars show 
s.e.m. For source data on tumour volume, see Supplementary Data Table 4. 
g, Apoptosis measured by caspase 3/7 activity in K or KL cells treated 

with decitabine for 48 h. Values are normalized to cell number (n= 3). 

h, i, Proliferation of KL cells transduced with empty vector (h) or LKB1 
cDNA (i) treated with decitabine. Data are expressed as relative to day 0 
(n= 3). j, Proliferation of KL cells transduced with empty vector or LKB1 
cDNA treated with RG108. Data are expressed as percentage of growth 

of untreated cells (n= 6). k-m, Proliferation of K or KL cells treated with 
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RG108 (n= 6) (k), EGCG (n= 12) (1) or SGI1027 (n= 12) (m). Data are 
expressed as percentage of growth of untreated cells. n—q, Mice bearing 
subcutaneous KPC tumours were treated with decitabine (n = 12) or 
vehicle (n = 12) when tumours reached 125 mm?. n, 0, Tumour volume (n) 
and final tumour weight (0). Error bars show s.e.m. For source data on 
tumour volume, see Supplementary Data Table 4. p, Haematoxylin and 
eosin-stained slides from representative tumours (top). Bottom, anti-CK19 
(green) was used to visualize the neoplastic epithelium and anti-PCNA 
(red) was used to mark proliferating cells. DAPI was used to stain nuclei 
(blue). q, Quantification of the CK19* neoplastic epithelial compartment 
(%CK19* cells/total cells; top). Quantification of CK19* cells with nuclear 
PCNA staining (bottom; n =6). Scale bars, 100 1m. Insets are threefold 
magnification. Data pooled from two (a-c, j) or four (k-m) experiments 
or representative of two (h) or three (g) experiments. Error bars show s.d. 
unless otherwise stated; *P < 0.05, **P< 0.01, ***P< 0.001. 
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Extended Data Figure 10 | See next page for caption. 
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Extended Data Figure 10 | Vulnerabilities of human LKB1 mutant 
pancreatic cancer cell lines. a, Three-day growth of LKB1 wild-type 
(black) or LKB1 mutant (red) human pancreatic cancer cells. Data are 
expressed as relative to shControl-transduced cells, which is arbitrarily 
set to 100 (n= 3). b, Quantification of tumour volume of subcutaneous 
tumours derived from implantation of the indicated cells transduced with 
shControl or two shRNAs against PSAT1 (n= 4). Error bars show s.e.m. 
For source data on tumour volume, see Supplementary Data Table 4. 

c, Immunofluorescence staining and quantification of 5mC in COLO357, 
PANCI1 and PATU-8988T cells transduced with shControl or two shRNAs 
against PSAT1 (607-760 cells for COLO357, 305-342 cells for PANC1 and 
623-889 cells for PATU-8988T). Scale bar, 25 \1m. Error bars show s.e.m. 
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Data are expressed as fluorescence per nucleus. d, Detection of 5mC by 
immunofluorescence in COLO357 (LKB1-deficient) cells transduced with 
vector or wild-type LKB1. Quantification is presented as fluorescence 

per nucleus (438-618 cells). Scale bar, 25 zm. Error bars show s.e.m. 

e, Three-day growth of LKB1 wild-type (black) or LKB1 mutant (red) 
human pancreatic cancer cells treated with 101M 3-deazaadenosine 
(3DZA) (top) or 2mM cycloleucine (CycloLeu) (bottom). f, Metabolic and 
epigenetic changes promoting transformation upon deletion of the tumour 
suppressor LKB1. Data pooled from (c, d) or representative of (a) two 
experiments. Error bars show s.d. unless otherwise stated; *P < 0.05, 

**EP < 0.01, ***P < 0.001. 
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doi:10.1038/nature20152 


Thermophilic archaea activate butane 
via alkyl-coenzyme M formation 


Rafael Laso-Pérez!*, Gunter Wegener)”, Katrin Knittel', Friedrich Widdel!, Katie J. Harding's, Viola Krukenberg'”, 
Dimitri V. Meier!, Michael Richter!, Halina E. Tegetmeyer?*, Dietmar Riedel’, Hans-Hermann Richnow’, LorenzAdrian®, 
Thorsten Reemtsma’®, Oliver Lechtenfeld® & Florin Musat!® 


The anaerobic formation and oxidation of methane involve unique enzymatic mechanisms and cofactors that are believed 
to be all specific for C,-compounds. Here we found that an anaerobic thermophilic enrichment culture composed 
of dense consortia of archaea and bacteria apparently uses partly similar pathways to oxidize the C,-hydrocarbon 
butane. The archaea, proposed genus Candidatus Syntrophoarchaeum, showed the characteristic autofluorescence 
of methanogens, and contained highly expressed genes encoding enzymes similar to methyl- coenzyme M reductase 
(MCR). We detected butyl-coenzyme M, indicating archaeal butane activation in analogy to the first step in anaerobic 
methane oxidation. In addition, Ca. Syntrophoarchaeum expressed the genes encoding beta-oxidation enzymes, carbon 
monoxide dehydrogenase and reversible C;-methanogenesis enzymes. This allows complete oxidation of butane. Reducing 
equivalents are apparently channelled to HotSeep-1, a thermophilic sulfate-reducing partner bacterium known from the 
anaerobic oxidation of methane. Genes encoding 16S rRNA and MCR similar to those identifying Ca. Syntrophoarchaeum 
were repeatedly retrieved from marine subsurface sediments suggesting that the presented activation mechanism is 


naturally widespread in the anaerobic oxidation of short-chain hydrocarbons. 


Etymology. Syntrophoarchaeum. syn (greek): together, tropho (greek) 
meaning nourishment, archaeum (greek): ancient; butanivorans} 
butanum (latin): butane, vorans (latin): eating, devouring. The name 
implies an organism capable of butane oxidation, howeyerdemanding 
syntrophic electron sinks. A second strain is named Ca. S. caldarius, 
caldarius (latin): warm/ hot, refers to its thermophilic growth condition. 
Locality. Enriched from hydrothermally-heated, hydroearbon-rich 
marine sediments of the Guaymas Basin at(2000m water depth, Gulf 
of California, Mexico. 

Diagnosis. Anaerobic, butane-oxidizing archaeon, variable morphol- 
ogy, 1.5 x 1 um, dependent on syntrophiesupport by the sulfate-reducing 
partner bacterium Candidatus Desulfofervidus auxilii. 

Massive amounts of natural gas migrate from deep-seated reser- 
voirs towards the seafloor!?. Mostiof this gas is already consumed 
in the anoxic zone byymicroorganisms coupling oxidation of the 
hydrocarbons to the reduction of the abundant electron acceptor, 
sulfate. Research on the anaerobic oxidation of natural gas has been 
largely focused on methane as the most abundant constituent”. The 
anaerobic oxidation of methane (AOM) is carried out by anaerobic 
methanotrophic archaea (ANME) forming consortia with sulfate- 
reducing bacteria (SRB); ANME activate methane to methyl- 
coenzyme M by methyl-CoM reductases (MCR)*°. The CoM-bound 
methyl group is further oxidized to CO) by reversing the enzymatic 
chain of methanogenesis®”’, while reducing equivalents are channelled 
to the partner SRB®”. 

However, natural gas generated by thermogenic decomposition 
of organic matter contains considerable amounts (up to 20%) short- 
chain alkanes including mostly ethane (up to 15%), but also propane, 
iso-butane and n-butane (hereafter butane)!™"'. Although these gases 
are potential growth substrates for microorganisms, the anaerobic 


oxidation of the non-methane hydrocarbons became of interest only 
recently. All cultures known so far to oxidize short-chain hydrocar- 
bons anaerobically are bacteria that couple complete substrate oxidiza- 
tion to CO? and sulfate reduction in one organism*"'®. The only pure 
culture available so far is strain BuS5, a deltaproteobacterium”. Strain 
BuS5 and related organisms activate propane and butane via addition 
to fumarate yielding (1-methylalkyl)succinate, to date the most-studied 
anaerobic activation mechanism for hydrocarbons'’”!”. A recent 
study, however, suggests that this mechanism is not always involved in 
short-chain hydrocarbon oxidation’. Here we combined physiological 
experiments, community sequencing, microscopy, omics approaches 
and metabolite analyses to provide evidence for an alternative anaerobic 
activation reaction of butane in a highly enriched thermophilic culture 
of archaeal-bacterial consortia. 


Archaea catalyse thermophilic butane oxidation 

Thermophilic anaerobic methanotrophic archaea (ANME-1) were pre- 
viously found to be associated with a deltaproteobacterial phylotype, 
HotSeep-1°! that had been detected independently in thermophilic 
enrichment cultures with short-chain alkanes!*'°. This prompted us 
to attempt again thermophilic enrichments with a short-chain hydro- 
carbon, butane; for these, we used inocula from Guaymas Basin sed- 
iment samples that had already been incubated with methane and 
developed thermophilic AOM activity. Indeed, after two months, also 
butane-dependent sulfate reduction became obvious. Further culti- 
vation and subsequent transfers to new medium with butane yielded 
a sediment-free culture, hereafter called Butane50. This culture 
reduced 10 mM sulfate within approximately 14 weeks of incubation. 
Quantitative growth experiments showed that the amount of butane 
degraded was apparently coupled to the stoichiometric reduction of 
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sulfate to sulfide in a 1:3.25 ratio, as demonstrated also for strain BuS5 
(Figure 1a; Supplementary Table 1), yielding the net reaction 


4C 4Hy) + 1380427 + 26H* > 16CO, + 13H,S + 20H,O 


In this culture the ANME-1 phylotype was no longer detectable by 
molecular analysis. Instead, archaea closely related to Methanosarcinales 
had become dominant due to the change to butane as substrate; they 
were members of the GoM-Arch87 clade” (Extended Data Table 1). Like 
ANME-1, also GoM-Arch87 forms densely packed consortia with the 
HotSeep-1 partner bacterium (Figure 1b,c). These consortia show strong 
blue-green autofluorescence (maximum at approx. 475 nm; Figure 1d), 
which is characteristic of cofactor F499, a hydrogen carrier typical for 
methanogenic and ANME archaea®”?. Phylotype GoM-Arch87 was 
originally detected in cold-seep areas of the Gulf of Mexico”, and since 
then was also found in other marine environments, especially seep and 
vent areas (see SILVA database release SSU 119; www.arb-silva.de). The 
environmental role of these archaea has been so far unknown. 

From the metagenome assembly we retrieved two bins of contigs 
indicating genomes of two different GoM-Arch87 (Supplementary 
Table 2 and Extended Data Table 2). The two GoM-Arch87 bins have 
sizes of 1.46 Mbps (GoM-Arch87-1) and 1.66 Mbps (GoM-Arch87-2). 
Based on their tRNA content, archaeal specific single copy gene num- 
bers and lineage-specific marker gene sets (Euryarchaeota) the draft 
genomes cover an estimated 85-89% of the complete genome of GoM- 
Arch87-1 and 95-97% of GoM-Arch87-2, indicating genome sizes 
of 1.64 to 1.71 Mbps. According to their 16S rRNA gene identity of 
96% and whole genome identity of 74-90% (Figure le; Extended Data 
Table 2), the two bins likely represent different species of one genus. 
Hence we propose to name the more abundant (according to genome 
coverage) strain GoM-Arch87-1, Candidatus Syntrophoarchaeum 
butanivorans and the less abundant strain GoM-Arch87-2, Candidatus 
Syntrophoarchaeum caldarius. The two strains have largely similarngene 
content, and are further discussed together as Ca. Syntrophoarchaeum, 

Ca. Syntrophoarchaeum encodes a fatty acid oxidation pathway, 
two complete acetyl-CoA decarbonylase/synthase:CO dehydrogenase 
(ACDS/CODH) complexes, and an almost complete methanogenesis- 
related pathway. Each of the two draft genomes contains four complete 
mcr gene sets, of which three (Ca. S. butanivorans) or all four (Ca. S. cal- 
darius) are organized in operons (Extended Data Figure 1). The correct 
assembly and the unusually high number of mcr genes in a single 
genome were confirmed by cloning and resequencing each of the mcrA 
genes. Metatranscriptome and metaproteome analyses (only studied on 
Ca. S. butanivorans) showed thatall thesegene/protein sets were highly 
expressed/abundant (Extended Data Tables 3-5). Neither the metagen- 
omic assembly nor the draft'genomes of Ca. Syntrophoarchaeum 
contain genes encoding the otherwise wide-spread glycyl radical 
enzymes (assA/masD, bssA) for,anaerobic hydrocarbon activation. 
Accordingly, these genes could not be amplified from extracted DNA 
using specific primers (see,methods). These results strongly indicated 
that the activation of butane in Ca. Syntrophoarchaeum proceeds via 
mechanismsidifferent than those described for bacteria. An appealing 
hypothesis was\that butane in Ca. Syntrophoarchaeum is activated by 
the distinct variants of MCR in a reaction analogous to that of methane 
at“reverse MCR in ANME. The expected first metabolite would thus 
be butyl-coenzyme M rather than methyl-coenzyme M. 


Ca. Syntrophoarchaeum forms butyl-coenzyme M 

Analysis of cell extracts of active Butane50 cultures by direct infusion 
ultra-high resolution mass spectrometry indeed provided direct evi- 
dence for butane-dependent formation of butyl-coenzyme M. A mass 
peak of m/z= 197.03116 was detected, which corresponded exactly to 
the hypothesized butyl-CoM (C¢Hj3S203—). Following quadrupole iso- 
lation and collision-induced fragmentation this compound yielded two 
major fragments representing a butylthiol (m/z= 89.0430, CyHS) and 
an apparently CoM-derived bisulfite (m/z = 80.9652, HSO3; ; Figure 2 
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and Extended Data Figure 2a). Use of synthesized authentic standards 
showed that the two isomers, 1-butyl-CoM and 2-butyl-CoM, were 
present and separated by liquid chromatography, with apparently 
higher amounts of the latter (Figure 2c). The presence of two butyl- 
CoM isomers offers, in principle, two explanations. First, one could 
be the initial product whereas the other is the subsequently formed 
intermediate. Second, only one isomer may be the genuine, directly 
metabolized activation product, whereas the other is formed as a 
by-product due to relaxed catalytic specificity of the activating enzyme; 
this would be similar to the anaerobic activation of propane by strain 
BuS5, where both n- and iso-propylsuccinate have been detected as 
activation products!**. Because the different amounts of the detected 
isomers reflect steady state concentrations (dynamic pools), they can- 
not offer clues to their metabolic significances To further corroborate 
butyl-CoM formation as a specific activation reaction, ive. exclude that 
it can be accomplished ‘accidentally’ by other means; we added butane 
to a methane-oxidizing ANME-1 enrichment. Neither butyl-CoM nor 
its derived fragments were detected. Moreover, these compounds were 
not detected in sterile controls, in substrate-starved Butane50 cultures, 
or in cultures of the bacterial butane-oxidizer strain BuS5 (Figure 2 and 
Extended Data Figure 2a). 

Mechanistic relatedness between butane activation at the MCR var- 
iant and methane activation atMCR of methanotrophic archaea was 
also shown by an inhibition experiment. The coenzyme M analogue, 
bromoethanesulfonate (BES), which inhibits methanogenesis as well as 
anaerobic methane oxidation” also inhibited butane-dependent sulfate 
reduction(Extended Data Figure 3). Inhibition must occur in the 
archaeal metabolism, because BES did not affect the partner bacterium, 
HotSeep-1 (Ca. Desulfofervidus auxilii) when separately grown with 
hydrogen and sulfate; the tested partner originated from the thermo- 
philic AOM culture”®. 

Aliquots of the Butane50 culture were also tested with other hydro- 
carbons. After approx. two months of incubation with propane, sulfate 
reduction became obvious. The rate increased and became similar as 
with butane. A possible shift in the microbial composition or gene 
expression upon incubation with propane has yet not been assessed. 
However, a mass peak exactly corresponding to that of propyl-CoM 
was analysed (m/z= 183.0155; Extended Data Figure 2c). This further 
corroborates the formation of alkyl-CoM as the apparent initial product 
in short-chain alkane activation. So far, no sulfate reduction was detect- 
able with added methane, ethane, iso-butane, n-pentane or n-hexane. 

This is the first demonstration of coenzyme M acting in vivo as 
a carrier of alkyl moieties other than the methyl group. The closest 
known precedent is the in vitro reduction of the homologous sub- 
strate ethyl-CoM to ethane by MCR from methanogenic archaea”””®. 
Turnover rates of ethyl-CoM in these assays were two orders lower 
than of methyl-CoM, which was explained by a steric hindrance of the 
larger ethyl-CoM molecule in the MCR enzyme”. This suggests that an 
efficient activation of butane to butyl-CoM may require highly adapted 
MCR-like enzymes. Analysis of the deduced amino acid sequences of 
the eight mcr genes detected in Ca. Syntrophoarchaeum showed a low 
similarity among them with an identity of only 29-82%. Two different 
MCR enzymes encoded in the genome of Ca. S. butanivorans were 
highly abundant in protein extracts of the culture, qualifying them as 
likely candidates for butane activation (Extended Data Table 3). 

A high substrate specificity of the MCR type apparently involved in 
butane oxidation was also indicated in an incubation experiment with 
both, butane and methane. In the active culture, butyl-CoM but no 
methyl-CoM was detected (Extended Data Figure 2b). Hence, there 
was no evidence for co-activation of methane, even though alone by 
its molecular size the latter should not encounter any steric binding 
hindrance. One may speculate that reaction at the enzyme requires 
substrate ‘fixation’ by binding of an alkyl chain of a minimum length. 
Understanding as to which extent the slightly weaker C—H-bond in 
higher alkanes in comparison to that in methane could play a role may 
need refined theoretical consideration. Furthermore, the population 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


and abundant MCR type in the recent propane-adapted subculture 
will have to be analysed. At the present state it is unknown whether 
propane and butane are activated by the same organism and enzyme 
or cause selection of different ones with respective substrate preference. 

Notably three of the McrA subunits of Ca. S. butanivorans have a 
related equivalent in Ca. S. caldarius. Interestingly the fourth McrA sub- 
unit (SBU_000314) forms a cluster with McrA sequences of the recently 
described Bathyarchaeota”, whereas the fourth sequence of Ca. S. cal- 
darius (SCAL_000352) has a distant phylogenetic position (Figure 3; 
for further phylogenetic analysis see Methods and Supplementary 
Figure 1). Together McrA sequences of Ca. Syntrophoarchaeum and 
Bathyarchaeota form a highly divergent cluster compared to those of 
methanogens or methanotrophs, which is likely due to the ability of the 
encoded enzymes to efficiently accommodate larger substrates such 
as butane. 


Complete oxidation of butane 

The measured stoichiometric balance shows that the CoM-bound 
butyl moiety must be completely degraded to CO>. Only a minor frac- 
tion is expected to enter biosynthesis, like in other strict anaerobes*”. 
Complete oxidation requires four basic metabolic features, the conver- 
sion of the buty]-thioether to the butyryl-thioester (presumably butyryl- 
CoA), oxidation of butyryl-CoA to acetyl-CoA, terminal oxidation 
of acetyl-CoA to CO;, and channelling of the reducing equivalents 
(electrons) into sulfate reduction. Our proteogenomic analysis provides 
explanations for all of these processes, with the exception of the first. 

The role and significance of the butyl-CoM isomers and their subse- 
quent processing steps could not be clarified on the basis of our pres- 
ent analyses. Also, there are no precedent cases of natural pathways 
involving non-methyl CoM-thioethers that could suggest particular 
reactions. Still, one may argue that 1-butyl-CoM is the genuine and 
directly metabolized activation product. Because MCR in AOM acti- 
vates a primary C—H-bond, its variant in culture Butane50 may also 
attack butane at the primary carbon. In this way, first oxidationand 
functionalization would occur at the same carbon atom which in a 
subsequent beta-oxidation would become the activated carboxyl group 
(thioester). In such case, 2-butyl-CoM would bea by-product; it is 
presently unknown whether an isomerase converts it,to 1=butyl-CoM. 

In methyl-CoM oxidation in AOM and,inithe oxidative branch 
in methanol-grown Methanosarcina, thesubsequent steps would be 
transfer of the methyl group to tetrahydromethanopterin (H4MPT) or 
tetrahydrosarcinapterin (H,SPT) and.oxidation via the bound formal- 
dehyde and formate states finally to CO. None of the genes encoding 
methyl-H,MPT:coenzyme M methyltransferase were found in the 
genome. An analogous transfer of the butyl-moiety by the abundant, 
presumably modified methyltransferase (Mta) may be only specu- 
lated about (Figure 4, Extended Data Table 3), because the postulated 
butyl-cobalamin intermediate;would be unprecedented. Furthermore, 
tetrahydropterins and methanofuran are, to our knowledge, exclusive 
C,-carriers. Hence; the enzymatic reactions converting 1-butyl-CoM 
to the butyryl (bound acid) level are presently puzzling, even though 
transfer.andyoxidation reactions of higher carbon compounds can be 
theoretically formulated also with the C;-carriers. 

The predicted beta oxidation enzymes of Ca. Syntrophoarchaeum 
(Figure.4; Extended Data Table 3) are highly related to those of 
deltaproteobacterial sulfate reducers and syntrophic partners of 
methanogens*!*” (Extended Data Table 6). This suggests horizon- 
tal gene transfer across the two domains of life. The gene encod- 
ing the most abundantly formed acyl-CoA dehydrogenase of Ca. 
Syntrophoarchaeum is located in an operon with genes for an electron 
transfer flavoprotein (etf) complex and a [FeS]-oxidoreductase. Hence, 
these gene products likely act as electron acceptors in the oxidation 
of butyryl-CoA (Figure 4) as previously shown for bacterial butyrate 
oxidizers*3**, 

The key enzyme for the metabolism of acetyl-CoA from beta- 
oxidation is the detected acetyl-CoA decarbonylase/synthase:CO 
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dehydrogenase (ACDS/CODH). The carboxyl group-derived bound 
CO can be oxidized to free CO? and yield reduced ferredoxin as shown 
before for methanogens”’. The methyl group is probably oxidized 
via the reverse methanogenesis pathway, of which almost all genes 
are present and their corresponding transcripts and proteins were 
detected (Figure 4, Extended Data Table 4). The N5,N10-methylene- 
tetrahydromethanopterin (methylene-H4MPT) reductase (mer)gene 
is absent in Ca. Syntrophoarchaeum. However, like in ANME-1 Mer 
could be substituted by the highly abundant putative methylenetet- 
rahydrofolate reductase (Met) complex as suggested before*’. The 
genes encoding the Met complex of Ca. Syntrophoarchaeum have 
a similar operon structure to those of Moorella.thermoacetica*’. In 
conclusion, the combination of enzymes in Ca. Syntrophoarchaeum 
results in the archaeal version of the oxidative’ Wood-Ljungdahl 
pathway, as originally shown for the sulfate-reducing Archaeoglobus 
fulgidus*’. 

Ca. Syntrophoarchaeum does not have'the genes for canonical sul- 
fate reduction (i.e. dsrAB, aprAB), and hence depends on an external 
electron sink. This role is apparentlyfulfilled by the partner bacterium. 
We tested for the production of hydrogen as a canonical intermediate in 
syntrophic associations.Although Ca. Syntrophoarchaeum encodes for 
a cytoplasmic [NiFe] hydrogenase (Extended Data Table 5), only minor 
amounts of hydrogen were produced in active or molybdate-treated 
Butane50 cultures, which cannot explain the required reducing equiva- 
lent transfer(Figure 5a).To further study how reducing equivalents are 
transferred to the partner bacterium, the draft genome of the Butane50 
HotSeep=1 strain was retrieved from the metagenome. Based on differ- 
entmarker genes, this HotSeep-1 strain is basically identical to Ca. D. 
auxiliij the partner in thermophilic AOM”® (Supplementary Table 3). 
Ca. D. auxilii is a lithoautotrophic sulfate reducer that in thermophilic 
AOM thrives on electrons directly supplied by the ANME via pili-based 
nanowires and cytochromes””®. Also in the Butane50 culture HotSeep-1 
expresses genes encoding pili assembly proteins, especially the main 
component of type IV pilus (PilA), and many different potentially 
cytochromes including extracellular ones (Extended Data Table 7). 
Indeed, using transmission electron microscopy we found a similarly 
dense, apparently pili-based, nanowire network in the intercellular 
space of the Butane50 consortia (Figure 5b). Based on these results 
we propose that nanowire-based direct interspecies electron trans- 
fer*® mediates also electron exchange in the butane-oxidizing consor- 
tia. From a thermodynamic point of view, the interspecies transfer of 
reducing equivalents would be also possible via H2; its measured partial 
pressure of ca. 1 Pa would allow energy conservation in both partners, 
albeit with quite unequal energy sharing (calculation in Supplementary 
Discussion; Supplementary Figure 2). However, the extremely slow rate 
of H, formation (Figure 5a) is difficult to reconcile with the observed 
rate of sulfate reduction. 

Channelling of the reducing equivalents to the sulfate-reducing 
partner bacterium requires electrons with redox potentials below 
the average potential of sulfate reduction (E° = —220 mV), which is 
reached by most proposed redox reactions. However, the oxidation 
of butyryl-CoA to crotonyl-CoA (Figure 4), will release electrons 
with far higher potentials (E”’= —125/—10mV°*?). A shift in the 
redox potential of these electrons can be achieved by different mech- 
anisms, including energy-driven reverse electron transport*? or 
electron confurcation*’, both demonstrated for syntrophic bacteria. 
Two of the required genes, encoding EtfAB and FeS oxidoreductase, 
were identified in Ca. Syntrophoarchaeum. Other genes with prod- 
ucts involved in electron transport have been identified including 
different heterodisulfide reductases (Extended Data Table 5) like 
one complex of HdrABC (SBUT_000297-299) followed by the 
genes encoding the 3-subunit of formate dehydrogenase (FdhB) 
and the 6-subunit of methylviologen-reducing hydrogenase (MvhD) 
(Extended Data Table 5). This complex was already identified in 
ANME-1 genomes and was hypothesised to be involved as electron- 


accepting complex’. 
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Evolutionary and environmental aspects 

The analysis of the anaerobic consortia enriched with butane provides 
first insights into an additional mechanism for the oxygen-independent 
activation and use of non-methane saturated hydrocarbons in archaea. 
Saturated hydrocarbons belong to the chemically least reactive com- 
pounds. In anaerobic bacteria, the radical-catalysed addition of alkanes 
to fumarate yielding substituted succinates is regarded as common*™"', 
even though alternative mechanisms have been proposed*”"?. The 
MCR-like proteins produced by Ca. Syntrophoarchaeum and forma- 
tion of butyl-coenzyme M in the culture suggest that the mechanistic 
principle for the activation of methane, the most stable hydrocarbon, 
was evolutionarily adapted to use also short chain alkanes. Another 
requirement for metabolic processing was, apart from the yet unsolved 
subsequent step of the alkyl-thioether, the acquisition of enzymes for 
beta-oxidation, which has been shown before in other archaea**. The 
subsequent reactions leading to CO) are enabled by steps which in prin- 
ciple are long-known in methanogens; acetate- and methanol-utilizing 
species contain CO-dehydrogenase/acetyl-CoA synthase as well as C,- 
processing enzymes able to operate in oxidative direction*’. Such an 
archaeal variant of the Wood-Ljungdahl pathway for acetyl-CoA oxida- 
tion was first described for the sulfate reducer Archaeoglobus fulgidus’. 

The presently suggested pathway for anaerobic oxidation of 
non-methane alkanes might not be limited to Ca. Syntrophoarchaeum. 
The recently detected uncultivated Bathyarchaeota contain genes for 
MCR types highly similar to those of Ca. Syntrophoarchaeum (Figure 3). 
Furthermore, the draft genomes of Bathyarchaeota”? as well as the 
pan-genome of uncultivated Hadesarchaea** encode most enzymes of 
the methanogenesis pathway, for short-chain fatty acid oxidation and 
for acetyl-CoA oxidation via ACDS/CODH. Based on these predicted 
features, Bathyarchaeota were described as heterotrophic methanogens. 
Our results indicate that at least some of these uncultivated archaea may 
be also viewed as alkane oxidizers. 

In conclusion, anaerobic microorganisms thriving on non-methane 
alkanes may be phylogenetically and metabolically even more diverse 
than hitherto thought*”“®. Such diversity may reflect a long-existing 
presence of saturated hydrocarbons in the biosphere and their exploita- 
tion as growth substrates through various evolving pathways, long 
before the oxygen-dependent ones. The factors which inmodern 
anoxic environments select for consortial, archaeal hydrocarbon oxida- 
tion over an oxidation by single bacterial spécies!*“are’still unknown 
and need more comprehensive in situ studies. 


Online Content Methods, along with any additionahExtended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Figure 1 | Characterization of the Butane50 culture. a, The Butane50 light at 405 nm and a longpass emission filter (>463 nm). The 
culture consumed butane (red squares) coupled with stoichiometric autofluorescence maximum at 470 nm is indicative of the presence of 
reduction of sulfate to sulfide (black circles); no sulfide was produced the cofactor F4.9; representative for 10 recorded images, scale bar = 101m. 
in cultures without butane (white circles), error bar = standard e, Phylogenetic affiliation of the 16S rRNA gene sequences from the 
deviation, n= 3. b,c, Fluorescence micrographs of Butane50\consortia studied Ca. Syntrophoarchaeum strains (in red) within the Euryarchaeota. 
stained with specific probes for GoM-Arch87 (red) and HotSeep-1 The GoM-Arch87 cluster is indicated with blue background; bar = 10% 
(green); representative for 20 recorded images, scale bar= 10 ym. estimated sequence divergence; bootstraps values >80% and >90% are 
d, Autofluorescence of microbial consortia visualized using excitation indicated by grey and black circles, respectively. 
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Figure 2 | Butyl-CoM as initial metabolic intermediate in butane 
oxidation a, Full scan mass spectrum of Butane50 culture extracts 
(Butane50+C4; n= 6 at different time points) revealed a peak at 

m/z = 197.0312 which matches the butyl-CoM standard (mass accuracy 
+0.18 ppm). This mass peak was absent in control incubations without 
butane (Butane50-C4; for more controls see Extended Data Figure 2). 

b, Isolation and collision induced fragmentation of this mass peak yielded 
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butylthiol (mass accuracy +0.01 ppm) and bisulfite (mass accuracy 
-+-0.03 ppm), which are likewise produced by fragmentation of the butyl- 
CoM standards. c, Liquid chromatography resolved the presence of the 
twoOlisomers 1- and 2-butyl-CoM in the culture (n = 4 at different sulfide 
concentrations). d, Interpretation of these analyses. Butane is activated 
by ligation to coenzyme M by MCR-like enzymes, yielding both 1- and 
2-butyl-CoM with identical fragmentation patterns. 
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Figure 3 | Phylogenetic affiliation of McrA amino acid sequences 
present in Ca. S. butanivorans and Ca. S. caldarius. The 
phylogenetic tree was constructed based on a maximum likelihood 
algorithm considering more than 500 amino acid positions. Red 
branches = sequences from Ca. Syntrophoarchaeum; SBU = Ca. 

S. butanivorans; SCAL = Ca. S. caldarius, the identifiers refer to the 
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locus.tag of the sequences in the draft genomes. Blue branches indicate 
Bathyarchaeota telated sequences from Evans et al”’. The scale bar 
indicates the number of amino acid substitutions per site. Bootstrap 
values higher than 90% are indicated by filled circles on the corresponding 
branch: 
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Figure 4 | Metabolic scheme proposed for butane oxidation with sulfate 
based on molecular analyses. Ca. Syntrophoarchaeum (red cell) uses 
steps of the methanogenesis pathway (red labels) to activate butane and 

to oxidize methylene-tetrahydromethanopterin to CO), Butyryl-CoA 
oxidation is catalysed by enzymes shared with syntrophic bacteria (blue 
labels). Acetyl-CoA is oxidized by a reverse Wood-Ljungdahl pathway 
(violet labels). Reducing equivalents are transferred via cytochromes 
(produced by both organisms) and pili-based nanowires to the sulfate- 
reducing HotSeep-1 partner bacterium (green cell and labels). Normalized 
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gene expression for Ca. S. butanivorans is indicated as FPKM (Fragments 
Per Kilobase of transcript per Million mapped reads) according to the 
legend-code; enzyme names in boldface indicate their detection in protein 
extracts; dotted arrows mark hypothetical pathways without detected 
genes; *mtd has only been detected in the more complete genome of Ca. 
S. caldarius. Symbol [e’] indicates electrons bound to heme or [FeS]- 
clusters in proteins, or to unknown or non-specified carriers. H2O, energy 
conservation and biosynthesis are not indicated. 
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Figure 5 | Testing metabolic interaction of Ca. Syntrophoarchaeum 
and Ca. D. auxilii in Butane50 cultures. a, Development of hydrogen 
concentration in replicate active Butane50 cultures (n = 2) without 
(circles) and next day continuation with molybdate addition (10 mM 
final concentration; triangles) to inhibit sulfate reduction. Hz production 
was between 3-7 nmol1~' h! and therefore 3-4 orders of magnitude 
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below a potential*hydrogen production of 46,000 nmol 1~! h~! if H, was 
the intermediate (determined from the measured sulfate reduction rate 

of 276 \1mol 1~4d~! before the experiments). b, Transmission electron 
micrographiof amMEPON 812 embedded consortium thin-section. The 
intercellular space in the consortium is filled with abundant nanowire-like 
structures; scale bar = 0.5 1m, representative for >30 recorded images. 
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METHODS 


Origin of inoculum and cultivation of the Butane50 enrichment culture. Gas- 
rich hydrothermally-heated sediments covered with dense mats of Beggiatoa were 
obtained in the Guaymas Basin vent area (27°00.437’ N, 11°24.548’ W; 2000 m 
water depth). Samples were collected by push coring using the submersible Alvin 
(dive 4570) on RV Atlantis during November/December 2009. The sediments 
were stored anaerobically in butyl rubber stopper-sealed glass vials. In the home 
laboratory sediments were 1:4 diluted with anoxic artificial seawater (ASW) 
medium”, initially provided with methane as a substrate, and incubated at 50°C. 
These incubations showed immediate methane-dependent sulfate reduction. After 
3 months a subsample was incubated with butane (0.2 MPa). Initially, we did not 
detect butane-dependent sulfide production, however after 2 months of incuba- 
tion sulfide production set in. When sulfide concentrations exceeded 15 mM the 
culture was diluted (1:5) in fresh ASW medium (semi-continuous cultivation) and 
resupplied with butane. This procedure was repeated several times and resulted in a 
virtually sediment-free culture after 2 years of cultivation. For quantitative growth 
experiments, cultures were set up in 150 ml serum bottles containing 80 ml ASW 
medium, and inoculated with a 20 ml aliquot of a grown culture. Parallel cultures 
with different starting amounts of butane (5 and 7.5 ml butane in the culture head- 
space) were prepared. As controls, we used sterile cultures receiving butane, and 
inoculated cultures lacking butane. All cultures were incubated at 50°C without 
shaking. Measurements of sulfide production and butane were done in triplicates. 
Chemical analyses of sulfide and butane. Sulfide concentrations were determined 
by transferring 0.1 ml culture into 4ml acidified copper sulfate (5 mM) solution. 
The formation of colloidal copper sulfide was determined photometrically at 
480nm°*”. To quantify butane concentration, volumes of 0.1 ml headspace gas were 
withdrawn using N>-flushed, gas-tight syringes. The gas samples were injected 
without a split into a Shimadzu GC-14B gas chromatograph, equipped with a 
Supel-Q PLOT column (30m x 0.53 mm, 30j1m film thickness; Supelco, Bellefonte, 
USA) and a flame ionization detector. The oven temperature was maintained at 
140°C, and the injection and detection temperatures were maintained at 150°C 
and 280°C, respectively. The carrier phase was N) at a flow rate of 3ml min |. 
Samples were analysed in triplicates. Butane concentrations were calculated based 
on an external calibration curve. 

Detection of autofluorescence in Butane50 culture. Consortia from the Butane50 
culture were visualized by Confocal Laser Scanning Microscopy (LSM 780;Zeiss, 
Germany) with an excitation light of 405 nm and an emission filter >463 nm, and 
by recording the maximum autofluorescence at 470 nm wavelength. 
Amplification, sequencing and phylogenetic classification of 16S rRNA and 
functional genes. Total DNA was extracted from 10 ml of the Butane50 culture 
pelleted via centrifugation (4000 rpm for 15 min; Eppendorf Centrifuge 5810R) 
using the FastDNA Spin Kit for Soil (MP Biomedicals) following the manufacturer's 
protocols. Bacterial and archaeal 16S rRNA gene fragments were amplified using 
the primer pairs GM3/GM4* and Arch20F*?/1492R™. Furthermore genes encod- 
ing canonical anaerobic hydrocarbon-activating enzymes including assA/masD 
(primer pairs 7757F-1, 7757F-2/8543R™) and bssA (primer pair 1213F/1987R™) 
were targeted for amplification. For amplification of assA, a mixture of forward 
primers was applied to improve diversity coyerage™. Polymerase chain reactions 
(PCR) were performed in 2011 volumes containing 0.5 1M of each primer solution, 
7.5/6 \.g bovine serum albumin solution;250|1M deoxynucleoside triphosphate 
(dNTP) mixture, 1 x PCR reaction buffer (5Prime, Germany), 0.25U Taq DNA pol- 
ymerase (5Prime) and 1 yl DNA‘template (25-50 ng). PCR reactions (Mastercycler; 
Eppendorf) included an initial denaturation step of 95°C for 5 min followed by 
34 cycles of denaturation (95 °Ofor 1 min.), annealing (1.5 min at 44°C for bacterial 
16S primers, ort 58°C for archaeal 16S primers), and extension (72°C for 3 min) 
and a final 72°C step forJ0 min. For amplification of genes encoding canonical 
hydrocarbon-activating enzymes, the protocol consisted of an initial denatura- 
tion step (95°C for 5min) followed by 34 cycles of denaturation (96°C for 1 min), 
annealing (58°C for assA primers and 55°C for bssA primers, both for 1 min) 
and extension (72°C for 2 min) ending with a final extension (72°C for 10 min). 
All products were checked on 1% agarose gels, stained with ethidium bromide 
and visualized with UV light. Amplicons (archaeal and bacterial 16S rRNA gene) 
were purified (QIAquick PCR Purification Kit; Qiagen) and cloned in Escherichia 
coli (TOPO TA cloning Kit for sequencing; Invitrogen). Clones were screened 
by standard PCR procedure and positive inserts were sequenced using Taq cycle 
sequencing with ABI BigDye Terminator chemistry and an ABI377 sequencer 
(Applied Biosystems, Foster City, CA, USA). Representative full-length sequences 
were used for phylogenetic analysis using the ARB software package*° and the 
SSURef_NR99_115 SILVA database*’. Phylogenetic trees of 16S rRNA genes were 
constructed with RAxML (version 7.7.2) using a 50% similarity filter and the 
GTRGAMMA model. An extended phylogenetic tree is provided as Supplementary 
Figure 3. Branch support values were determined using 100 bootstrap 
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replicates. From the Butane50 culture no masD/assA and bssA genes could be 
amplified. 

Catalysed reported deposition fluorescence in situ hybridization (CARD- 
FISH). Cell aliquots were fixed for 2 hours in 2% formaldehyde, washed and 
stored in phosphate buffered saline (PBS; pH = 7.4): ethanol 1:1. Samples were 
sonicated (30s; Sonoplus HD70; Bandelin) and incubated in 0.1 M HCl (1 min) 
to remove potential carbonate precipitates. Aliquots were filtered on GTTP poly- 
carbonate filters (0.2 1m pore size; Millipore, Darmstadt, Germany). CARD-FISH 
was performed according to Pernthaler et al°* including the following modifica- 
tions: cells were permeabilised with a lysozyme solution (0.5 M EDTA pH 8,0, 1M 
Tris-HCl pH 8.0, 10mg ml“! lysozyme; Sigma-Aldrich) at 37 °C for 30 minutes 
and with a proteinase K solution (0.5 M EDTA, 1 M Tris/HCl, 5M NaCl, 715 1M 
of proteinase K; Merck, Darmstadt, Germany) for 5 min at room temperature; 
endogenous peroxidases were inactivated by incubation in asolution of 0.15% 
H,0) in methanol for 30 min at room temperature. Specific 16S rRNA-targeting 
oligonucleotide probes used were SYNA-407 and HotSeep-1-14567°, both 
applied at 20% formamide concentration. SYNA-407, was developed during 
this project using the probe design tool within the ARB software package to 
specifically detect Ca. Syntrophoarchaeum. The probe is highly specific for Ca. 
Syntrophoarchaeum and has at least one mismatch to non-target group sequences 
in the current database. The stringency of probe SYNA-407 was experimentally 
tested on the Butane50 culture using 10% to 40% formamide in the hybridiza- 
tion buffer. The sequence of the probe is’'5’ AGTCGACACAGGTGCCGA 3°. 
Three helpers were necessary: hSYNA-388 (5’ ACTCGGAGTCCCCTTATC 3’), 
hSYNA-369 (5° CACTTGCGTGCATTGTAA 3’) and hSYNA-426 (5° TATCC 
GGACAGTCGACAC 3’). Probeswere purchased from Biomers (Ulm, Germany). 
In case of doubleshhybridization, the peroxidases from the first hybridization were 
inactivated by incubating the filters in 0.30% HO, in methanol for 30 min at room 
temperature. As fluorochromes Alexa Fluor 488 and Alexa Fluor 594 were used. 
Finally, the,filters were stained with DAPI (4,6’-diamino-2-phenylindole) and 
analysed by epifluorescence microscopy (Axiophot II Imaging, Zeiss, Germany). 
Selected filters were analysed by Confocal Laser Scanning Microscopy (LSM 780, 
Zeiss, Germany). 

Extraction of genomic DNA, library construction and sequencing. Genomic 
DNA was extracted from 15 ml of the Butane50 culture using the FastDNA Spin 
Kit for Soil (MP Biomedicals, Hlkirch, France). For paired-end library prepara- 
tion the TruSeq DNA PCR-Free Sample Prep Kit (Illumina) was used including 
the following modifications of the manufacturer’s guidelines: A total amount of 
700ng DNA (in 50,11 volume) was fragmented in 50011 nebulization buffer (50% 
glycerol v/v, 35 mM Tris-HCl, 5mM EDTA), using a Nebulizer (Roche), with a 
fragmentation time of 3 min, and applied pressure of 32 psi. The fragmented DNA 
was purified via a MinElute purification column (Qiagen). Following end repair, 
the first size selection step (removal of large DNA fragments) was done with a 
sample purification bead/H2O mixture of 6/5 (v/v). 

For mate-pair library construction, genomic DNA was extracted from 35 ml 
Butane50 culture following the protocol after Zhou et al”? with the following mod- 
ifications: cells were collected by centrifugation of the culture aliquot (3000 x g for 
5 min). The pellet was resuspended in 450 1l of extraction buffer, homogenized 
in a tissue grinder and the mixture was freeze-thawed three times. Subsequently 
13501 of fresh extraction buffer and 6011 of Proteinase K were added. In total, 
1370 ng of DNA were obtained and used for mate-pair library construction with the 
Illumina Nextera Mate Pair Sample Preparation Kit following the manufacturer's 
guidelines with the following modifications: a total amount of 1.3 jug DNA was 
used and the fragmentation time was reduced to 15 min. Fragments of lengths 
between 4kb and 9kb were obtained on an agarose gel which were then used 
for further library preparation. Sequencing of both libraries was performed on a 
MiSeq 2500 instrument (Illumina; 2 x 300 cycles) using v3 sequencing chemistry. 
In total 4460548 and 21182518 reads were obtained for the paired-end and mate- 
pair library respectively. 

Read processing, bin assembly and data analysis. The paired-end Illumina reads 
were quality-trimmed after adaptor and contaminant removal using the bbduk 
tool in BBMap (V34: http://sourceforge.net/projects/bbmap; minimum quality 
value of 20; minimum read length > 50 bp). Overlapping paired-end reads were 
merged using bbmerge when overlap exceeded 20 bases without mismatches for 
reads > 150 bp. The 16S rRNA based phylogenetic composition of the paired-end 
library was estimated using the software phyloFlash (https://github.com/HRGV/ 
phyloFlash), which classifies reads taxonomically by mapping reads against the 
SSU SILVA 119 database using bbmap. For quantification, only unambiguously 
mapped reads were counted. For the mate-pair library, junctions, contaminants and 
external adaptors were removed using bbduk. Afterwards, the reads were quality 
trimmed (quality value > 20 and minimum sequence length 50 bp). Bulk assem- 
bly of processed libraries was done with SPAdes (version 3.5.0%) including the 
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BayesHammer error correction step and using default k-mer size recommended 
for the read length (21, 33, 55, 77, 99, 127). The resulting scaffolds were analysed 
and binned using the Metawatt software (version 2.1°'), which analyses the GC 
content, coverage, open reading frames (ORF) and tetranucleotide pattern for each 
scaffold. The subsequent binning of the scaffolds was based on three different 
criteria: highly similar tetranucleotide frequency (98% confidence level), coherent 
taxonomic classification according to BlastP search of the translated ORFs and 
similar GC content and read coverage in the metagenome. Using the software 
RNAmmer™, the 16S rRNAs present in the bulk assembly were extracted to clas- 
sify the different bins of the bulk assembly phylogenetically. Bins corresponding 
to the GoM-Arch87 group were selected and refined. The refinement started with 
a mapping of the raw reads (from complete libraries) to the selected bins (with a 
minimum identity of 90% the first time and 97% the next ones) using the bbmap 
tool from the BBMap package. The mapped reads were reassembled using SPAdes 
(same settings as for the bulk assembly), followed by binning in Metawatt. Contigs 
smaller than 1 kb were removed from the bin. The mate-pair read mapping infor- 
mation of the bin was used to create connectivity graphs using Cytoscape®* and 
to remove poorly-connected contigs. After bin refinement, its completeness was 
checked using AMPHORA2%, which screens for 104 archaeal single copy genes; 
CheckM®, which analyses completeness and contamination based on lineage- 
specific marker sets, in our case Euryarchaeota and tRNAscan’, which screens 
for the different tRNA sequences. The final bins were used as draft genome of Ca. 
S. butanivorans and Ca. S. caldarius for automated gene annotation in RAST 
and genDB® after gene prediction using Glimmer3.02”. After selecting the best 
annotation for each ORF using the automated annotation tool MicHanThi”!, the 
GenDB results were visualized using the JCoast frontend”. All presented genes 
were manually curated afterwards. 

Analysis of species identity and bacterial electron transfer mechanisms. 
A HotSeep-1 bin was retrieved and annotated as described above for Ca. 
Syntrophoarchaeum. To compare our HotSeep-1 bin and the published draft 
genome of Ca. D. auxilii (CP013015), JSpecies1.2.17° was used, which analyses 
the average nucleotide identity and the tetranucleotide frequency between two 
genomes. This method was also used to compare the two genome bins of Ca. 
Syntrophoarchaeum. Furthermore, the two HotSeep-1 strains were compared 
by checking the identity of the following genes: 16S rRNA, 23S rRNA, sulfate 
adenylyltransferase (sat), adenylylsulfate reductase subunit alpha (apr alpha), 
adenylylsulfate reductase subunit beta (apr beta) and dissimilatory sulfite reduc- 
tase subunit alpha (dsr alpha) and of the internal transcribed spacer (ITS) region. 
To study genes encoding pili and cytochromes of HotSeep-1, genes ofinterest were 
identified. This selection was manually curated using Blastp and Pfam search. The 
subcellular localization of cytochromes was predicted usingPSORTb (v.3,0.2”*). 
Search for canonical alkyl succinate synthase genes. To search for canonical genes 
of hydrocarbon oxidation in the metagenome and the bins of Ca. S.butanivorans 
and Ca. S. caldarius, a protein database of anaerobic hydrocarbon oxidation genes 
was constructed. Full-length sequences from hydrocarbon degrading enzymes 
present in the Uniprot database were combined with recently published masD 
sequences’°. These enzymes were AssA, BssAjMasD, the alpha subunit from 
naphtylmethylsuccinate synthase (Nms), the alpha subunit from a ring cleaving 
hydrolase (BamA), and pyruvate formate lyase (Pfl). The bulk assembly and the 
Ca. Syntrophoarchaeum draft genomes were searched against this database using 
Blastx with an E-value of 10°* 

Treatment of Butane50 culture with bromoethanesulfonate. Triplicate Butane50 
cultures and duplicates of CayD. auxilii cultures were grown on their respective 
substrates (butane onhydrogen). Two active Butane50 cultures were incubated with 
bromoethanesulfonate(BES, 5mM final concentration) and as growth control, 
one culture remained untreated. To check the effect of BES on the bacterial part- 
ner alone, hydrogenotrophic grown Ca. D. auxilii cultures were also treated with 
5mM of BES. Sulfate-reducing activity was determined by sulfide measurements 
as described above. 

Phylogenetic analysis of methyl-CoM reductases in Ca. Syntrophoarchaeum 
draft genomes. The McrA amino acid sequences in the genomes of Ca. 
S. butanivorans and Ca. S. caldarius were extracted from the genomic data, and 
used for a phylogenetic reconstruction. 124 reference McrA protein sequences 
longer than 450 amino acids from public databases were aligned with Muscle3.7”°, 
accession numbers of these sequences are provided in the Supplementary Table 
4. After manual refinement of the alignment a masking filter accounting the 
alignment ambiguity of each column was designed using the ZORRO software. 
Phylogenetic trees were calculated using Maximum likelihood algorithm RAxML 
(version 8.2.6’”) with the masking filter and the PROTGAMMA model with LG 
as amino acid substitution model and empirical base frequencies. These were the 
best-fitting conditions according to RAxML using both Akaian and Bayesian infor- 
mation criterion. To find the optimal tree topology 149 bootstraps were calculated 


according to the bootstrap convergence criterion of RAxML. To verify results of the 
presented phylogenetic affiliation, the phylogenetic analyses were repeated using 
1Q-TREE” with LG-+I+F+C20 as substitution model on the same alignment 
(Supplementary Figure 1a). 

To avoid the possibility of long branch attraction, further partial McrA 
sequences of Bathyarchaeota (Supplementary Table 4) were included and only the 
McrA sequence regions common between the partial McrAs of Bathyarchaeota 
and our previous set of full-length sequences (>300 residues) was considered for 
phylogenetic analysis. First, it was confirmed that using these regions for phyloge- 
netic analysis resulted in similar tree topology as using the full-length sequences 
by calculating a phylogenetic tree using RAxML (PROTGAMMALG+1+F) with 
the respective parts of all full-length sequences (the data set used in thejprevious 
phylogenetic analysis; Supplementary Figure 1b). Then the partial sequences of 
Bathyarchaeota were included into the set to perform phylogenetic analysis of the 
common McrA sequence parts using both RAxML (PROTGAMMALG-+I+F, 
Supplementary Figure 1c) and IQ-tree (LG+I+F-+C20,Supplementary Figure 1d). 
Finally, to check if the overall tree topology was influenced by the deeply-branching 
SCAL_000352 sequence, a tree using RAxML (PROTGAMMALG+1+F) with 
only full-length sequences but excluding the SGAL_000352 sequence was con- 
structed (Supplementary Figure le). Allresulting trees were plotted using the iTol 
webserver”? 

Verification of different McrA operons. To test the correct genome assembly and 
to confirm the presence of four mcrA genes per bin, a mcrA clone library was con- 
structed. For each of the 8 mcrA genes found in the two Ca. Syntrophoarchaeum 
bins primer sets were developed, which were used for PCR amplification from 
Butane50 culture DNA (Supplementary Table 5). PCR reactions (2011 volume) 
were performed containing 1 uM primer each, 200|1M dNTPs, 1 x PCR buffer, 
and 0.5 U DNA polymerase\(TaKaRa Taq, TaKaRa Bio Europe, France) under the 
following conditions: initial denaturation at 95°C for 5 min, followed by 39 cycles 
of denaturation (96°C; min), annealing for 1 min, elongation (72°C, 2 min), and 
a final elongation step (72°C, 10 min). For two primer sets, amplification was 
done with Phusion High-Fidelity DNA Polymerase (Thermo Fischer Scientific, 
Germany) using 50,11 reactions containing 1.5mM MgCh, 3% (v/v) DMSO, 0.4.M 
primer each, 50,1M dNTPs, 1 x PCR buffer, and 1 U DNA polymerase under the 
following conditions: initial denaturation at 98°C for 30s, followed by 39 cycles 
of,denaturation (98 °C, 10s), annealing for 30s, elongation (72°C, 50s), and a 
final elongation step (72°C, 10 min). For annealing temperatures for the individual 
primer sets see Supplementary Table 5. PCR resulted in multiple bands, therefore 
amplicons of expected size were excised from an 1% agarose gel and purified using 
the MinElute Gel extraction kit (Qiagen, Germany). DNA was ligated in a pGEM 
T-Easy vector (Promega, Madison, WI) and transformed into E. coli TOP10 cells 
(Invitrogen, Carlsbad, CA) according to the manufacturers’ recommendations. 
Sequencing was performed by Taq cycle sequencing using a vector-specific primer 
(M13F or M13R) with a model ABI377 sequencer (Applied Biosystems). Sequence 
data were analysed with the ARB software package”°. 

Extraction of RNA, library construction and data analysis. Total RNA was 
extracted from 100 ml of an active Butane50 culture, which was kept at 50°C during 
the whole procedure: first most medium (>90%) was replaced by butane gas, 
whereas the biomass remained at the bottom of the bottle. Then RNA was pre- 
served by adding 90 ml preheated RNAlater (Sigma-Aldrich; 10:1 RNAlater vs. 
sample) for 1 hour. Subsequently this mixture was filtered through an RNA-free 
cellulose nitrate filter (pore size 0.45 j1m; Sartorius; Gottingen, Germany). The 
filter was extracted in an RNase-free tube with glass beads and 600 1l of RNA 
Lysis Buffer (Quick-RNA MiniPrep, Zymoresearch, USA) applying bead beat- 
ing (2 cycles of 6 m/s for 20s). The lysate was cleared by centrifugation (10000g; 
1 min) and the supernatant was used for RNA extraction with the Quick-RNA 
MiniPrep Kit (Zymoresearch, Irvine, CA, USA) according to the manufacturer's 
guidelines but omitting the on-column DNase treatment step. The RNA extract 
was cleaned from DNA by incubating it at 37°C for 40 min with 1011 of DNase I 
(DNase I recombinant, RNA-free; Roche Diagnostics, Mannheim, Germany), 7 il 
of 10 x incubation buffer (Roche) and 211 of RNase-Inhibitor (Protector RNase 
Inhibitor, Roche Diagnostics, Mannheim, Germany). DNases were inactivated by 
heating for 10 min to 56°C. Subsequently the RNA was purified with the RNeasy 
MinElute Cleanup Kit (QIAGEN, Hilden, Germany). In total 450 ng high quality 
RNA were obtained. 

The TruSeq Stranded Total RNA Kit (Illumina) was used for RNA library 
preparation. The rRNA depletion step was omitted. 80 ng of the total RNA (in 5,1 
volume) was combined with 13 11 of ‘Fragment, Prime and Finish mix, for 
the RNA fragmentation step according to the Illumina TruSeq stranded mRNA 
sample preparation guide. Subsequent steps were performed as described in the 
sample preparation guide. The library was sequenced on a MiSeq instrument; with 
v3 sequencing chemistry in 2 x 75 cycles paired-end runs. The resulting reads 
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were pre-processed including removal of adaptors and contaminants and quality 
trimming to Q10 using bbduk v34 from the BBMAP package. Trimmed reads 
were used to quantify the 16S rRNA gene based phylogenetic composition of the 
library by phyloFlash as described above for the DNA paired-end library. Trimmed 
reads were also mapped to the bins of interest (Ca. S. butanivorans, HotSeep-1) 
using bbmap with a minimum identity of 97%. The expression level of each gene 
was quantified by counting the number of unambiguously mapped reads per gene 
using featureCount® with the —p option to count fragments instead of reads. To 
compare expression levels between genes, absolute fragment counts per genes were 
converted into Fragments Per Kilobase of transcript per Million mapped reads 
(FPKM®!) as follows: 


C; 


x 10° 


FPKM;= 


where i= any specific gene, j= sum of all the transcribed genes, C= counts and 
L=length (bp). 
Protein analysis by nanoLC-MS/MS. For total protein analysis, the cells from 
50 ml of grown (ca. 10mM sulfide) Butane50 enrichment culture were harvested 
by centrifugation, frozen in liquid nitrogen and stored at —20°C until analysis. The 
cell pellets were suspended in 30 11 of 50 mM ammonium bicarbonate buffer, and 
lysed by three 60s freeze-thaw cycles between liquid nitrogen and +40°C (thermal 
shaker, 1400 rpm). The cell lysate was incubated with 50 mM dithiothreitol at 30°C 
for 1h, followed by alkylation with 200 mM iodacetamide for 1 h at room tempera- 
ture, in the dark, and trypsin digestion (0.6 1g trypsin, Promega) overnight at 37°C. 
Peptides were desalted using C18 Zip Tip columns (Millipore), and analysed by 
nLC-MS/MS using an LTQ-Orbitrap mass spectrometer (Thermo Fisher Scientific) 
equipped with a nanoUPLC system (nanoAquity, Waters) as described before*”. 
Peptide identification was conducted by Proteome Discoverer (v1.4.1.14, 
Thermo Fisher Scientific) using the Mascot search engine with the annotated 
metagenome of Ca. Syntrophoarchaeum as database®. Peptides were considered 
to be identified by Mascot when a probability of 0.05 (probability-based ion score 
threshold of 40) was achieved. emPAI values calculated by Mascot for identified 
proteins were used as semi-quantitative measure to estimate the abundance of 
proteins in the analysed sample*’. The mass spectrometry proteomics data have 
been deposited to the ProteomeXchange Consortium™ via the PRIDE partner 
repository®. 
Synthesis of authentic standards. To synthesize 1-butyl-CoM and 2-butyl-CoM, 
5g of coenzyme M (Na 2-mercaptoethanesulfonate, purity 98%; Sigma Aldrich) 
were dissolved in 40 ml of a 30% (v/v) ammonium hydroxide/solution, in serum 
vials. Twice the molar amount of 1-bromobutane (purity 99%3Sigma Aldrich) or 
2-bromobutane (purity 98%; Sigma Aldrich) were added, the serum bottles were 
closed with butyl rubber septa and incubated at room temperature with vigorous 
shaking (500 rpm) for 4h. The aqueous phase was separated fronythe excess hydro- 
phobic 1- or 2-bromobutane via separatory funnels. Residual, dissolved 1- or 2-bro- 
mobutane was removed by bubbling with nitrogen. Thesolutions were analysed for 
the presence of 1-butyl-CoM or 2-butyl-CoMpby FT-ICR-MS analysis without fur- 
ther purification. Both solutions contained a majorm/z peak at 197.0311; no m/z 
peaks indicative of free CoM, CoM dimers, 1- or 2-bromobutane were detected. 
Both standards were stable andmo interconversion of isomers was observed. 
Metabolite extraction. For preparation of cell extracts, volumes of 20 ml were col- 
lected from grown Butane50 cultures (sulfide concentrations of 14-15mM) under 
anoxic conditions. The cells were harvested by centrifugation (10 min, 10,000 rpm, 
4°C), washed twice with a 100 mM ammonium bicarbonate solution, and finally 
suspended in Lml of.acetonitrile: methanol: water solution (40:40:20 v/v). Glass 
beads (0.1 mm diameter, Roth) were added (0.3 g per tube), and the cells were lysed 
with a PowerLyzer 24 bench top bead-based homogenizer (MO BIO Laboratories, 
CarlsbadsCA) using’5 cycles of 2000 rpm for 50sec, with a 15 sec pause between 
cycles. Prior use, the glass beads were treated with 1N HCl solution and washed 
twice with deionized water. Glass beads and cell debris were removed by centrifu- 
gation, and:the aqueous cell extracts were stored in glass vials at 4°C until analysis. 
Mass spectrometry of cell extracts and standards. Authentic standards and cell 
extract samples were measured with ultra-high resolution mass spectrometry 
(SolariX XR 12T Fourier transform ion cyclotron resonance mass spectrome- 
ter, Bruker Daltonics Inc., Billerica, MA) with negative electrospray ionization 
(capillary voltage: 4.5 kV) in direct infusion mode (4 11/min and 0.1s accumula- 
tion time). Spectra were recorded with a 2 MWord time domain (0.42s transient 
length) between m/z 74 and 3000 resulting in a mass resolution of ca. 250,000 at 
m/z 200. Instrument mass accuracy was linearly calibrated with low-molecular 
mass fatty acids (C4 - C12) between 88 and 199 Da, resulting in an average root- 
mean square error of the calibration masses of 39 ppb (n= 7). For each meas- 
urement, 64 (Butane50 samples), or 128 (controls) spectra were co-added (lock 
mass: 143.10775 m/z) and internally recalibrated with naturally present fatty acids. 
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Collision induced fragmentation of m/z 197 was carried out after quadrupole 
isolation (10 Da window) with 12 V collision energy and 128 scans per measurement 
(lock mass: 199.17035 m/z). The 1-butyl-CoM and 2-butyl-CoM standards were 
diluted to ca. 10j1g/ml and checked for appropriate collision energy and fragment 
pattern. Fragment masses 89.0430 (C4yHoS~ ) and 80.9652 (HSO3_) were then used 
as indicative fragment for butyl-CoM in the cell extracts. The formation of an 
even-electron fragment HSO3 from bisulfite is favoured when a 3-H-atom is 
present®®, However, SO3 * (m/z=79.9674) was also produced upon fragmentation 
of the standards. 

Fragmentation information of the butyl-CoM standards was used to implement 
a UPLC-MS/MS method to validate the isomeric form of m/z 197:031 in the sam- 
ples. A triple quadrupole mass spectrometer (Xevo TQ-S, Waters Cooperation, 
Manchester, UK) in negative electrospray ionization mode was used in multiple 
reaction monitoring (MRM) mode. Indicative butyl-CoMbtransitions (m/z 197 > 89 
and m/z 197 > 81) were initially optimized (cone voltage and collision energy) by 
direct infusion of standard solutions into the mass spectrometer. The mass spec- 
trometer was coupled to a UPLC (ACQUITY I-Glass, Waters Cooperation Milford, 
MA, USA) equipped with a reversed phasecolumn (HSS T3, 25cm, Waters) and 
run with a binary gradient (1% methanolin Water to 90% methanol) at a flow rate 
of 0.3ml min '. For each analysis, LO{t were injected into the UPLC. Retention 
time, presence of both MRM transitions and relative ion ratios as compared to the 
standards were used as quality criteria. 
Quantification of hydrogen production in experiment. Hydrogen production 
in the Butane50 culture was measured by analysing the headspace of replicate 
incubations which were constantly agitated on a shaking table in a 50°C-incubator. 
The butane-dependent sulfideproduction (and therefore potential hydrogen 
production) was;determined from tracking the sulfide production (as above) for 
4 weeks. Gas phase (1 ml) was sampled with a gas-tight syringe to determine hydro- 
gen concentrations (i) before changing the headspace, (ii) after exchanging the 
headspaceiin 30minute intervals for 6 hours (iii) the next day, before and after 
addition of sodium molybdate solution (10 mM final concentration) to the culture 
to stop\potential hydrogen-dependent sulfate reduction. Gas phase was imme- 
diately injected into a Peak Performer 1 gas chromatograph (Peak Laboratories, 
Palo Alto, CA) equipped with a reducing compound photometer. Development 
of hydrogen concentrations were converted into hydrogen production rates and 
compared with potential hydrogen production rates according to a stoichiometry 
of 4:1 (H2-production vs. sulfate reduction). 
Transmission electron microscopy. A 100ml grown Butane50 culture was con- 
centrated by centrifugation at 2,000 rpm using a Stat Spin Microprep 2 table-top 
centrifuge. Aliquots were placed in aluminium platelets of 150|1m depth containing 
1-hexadecen*”. The platelets were frozen using a Leica EM HPM100 high pressure 
freezer (Leica Mikrosysteme, Wetzlar, Germany). The frozen samples were trans- 
ferred to an Automatic Freeze Substitution Unit (Leica EM AFS2) and substituted 
at —90°C in a solution containing anhydrous acetone, 0.1% tannic acid for 24h 
and in anhydrous acetone, 2% OsOu, 0.5% anhydrous glutaraldehyde (Electron 
Microscopy Sciences, Ft. Washington, USA) for additional 8h. After a further 
incubation over 20h at —20°C samples were warmed up to +4°C and washed with 
anhydrous acetone subsequently. The samples were embedded at room tempera- 
ture in Agar 100 (Epon 812 equivalent) at 60°C over 24h. Thin sections (80 nm) 
were examined using a Philips CM 120 BioTwin transmission electron micro- 
scope (Philips Inc. Eindhoven, The Netherlands). Images were recorded with a 
TemCam F416 CMOS camera (TVIPS, Gauting, Germany), for additional images 
see Supplementary Figure 4. 
Data availability. All sequence data are archived in NCBI database under the 
BioSample number SAMN05004607. Representative full-length 16S rRNA gene 
sequences of the clone library of the Butane50 culture have been submitted to 
NCBI under accession numbers KX812780-KX812802. Draft genomes of the Ca. 
Syntrophoarchaeum organisms can be found under the BioProjectPRJNA318983 
(Ca. S. butanivorans) and PRJNA319143 (Ca. S. caldarius). Metagenomic 
and metatranscriptomic reads have been submitted to the short read 
archiveSRS1505411. The mass spectra of the proteomic dataset have been depos- 
ited to the ProteomeXchange Consortium with the dataset identifier PXD005038. 
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Extended Data Figure 1 | Genetic structure of mcr genes in Ca. Syntrophoarchaeum. In Ca. S. butanivorans one mcr gene set is separated, with the 
mcrA.subunit in scaffold 1, and mcrB and merG in scaffold 4. 
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Extended Data Figure 2 | Experiments validating production of alkyl- methane (n= 3) and in the Butane50 culture (n= 2). Methyl-CoM was 
CoM compounds in anaerobic cultures. ajScreening for butyl-CoM in only found in the thermophilic AOM culture. These analyses (a,b) indicate 
Butane50, in a thermophilic AOM culture supplied with butane (n =2 high substrate specificity of the organisms in the two different cultures. 
with 2 different sampling time points); in BuS5 cultures (n = 3) and in c, The also established propane-degrading culture (n= 1) forms propyl- 
controls. The mass peak of butyl-CoM (m/z= 197.0312) was only found CoM (m/z= 183.016; mass accuracy -0.21 ppm) showing CoM-type 
in the Butane50 culture. b, Screening for methyl-CoM (m/z = 154.984; activation of this substrate. 


mass accuracy —0.15 ppm) in the thermophilic AOM culture supplied with 
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Extended Data Figure 3 | Effect of bromoethanesulfonate (BES) on sulfate reduction compared to a control without BES (triangles, n= 1) and 
Butane50 and Ca. D. auxilii cultures. After growth on their specific to b, Ca. D. auxilii cultures (triangles, n= 2) where it had no influence on 
substrates, BES (5mM final concentration) was added to a, Butane50 hydrogen-dependent sulfate reduction. 


cultures (circles, n =2) where it immediately inhibited butane-dependent 
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Extended Data Table 1 | Microbial diversity in the early AOM enrichment?! used as inoculum and in the Butane50 culture 


Early AON Butane50 culture* 
enrichment 
Phylogenetic group Clones Clones Metagenome Metatranscriptome 
reads reads 
Archaea 
Euryarchaeota 
Methanomicrobia 
GoM-Arch87 46 596 32009 
Methermicoccus 2316 
ANME-1 46 1465 
Thermoplasmata 6 
19c-33 cluster 38 199 
pMC2A24 cluster 1 30 
20c-4 cluster 132 
Halobacteria 4 
Sum 56 85 957 35790 
Bacteria 
Proteobacteria 
Deltaproteobacteria 
HotSeep-1 cluster 53 63 575 4780 
Syntrophobacteraceae 1 5 626 
Others 3 3 
Spirochaete 
Kazan-3B-09 1 
TAO06 8 44 
Candidate division OP3 3 6 36 
Candidate division KB1 159 827 
Candidate division WS3 33 
BHI80-139 1 
Chloroflexi 3 
Anaerolineaceae 3 138 
Others 17 
Sum 80 92 985 6233 


*based on 16S rRNA gene sequences retrieved by clone library approach from the Butane50 culture, and found in the metagenome and metatranscriptome libraries using the phyloFlash software. 
Taxa which account for >1% of all 16S rRNA gene sequences are shown. 
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Extended Data Table 2 | Draft genome information and pairwise comparison of whole genome identity of Ca. S. butanivorans and 


the Ca. S. caldarius 


Candidatus Candidatus 
Syntrophoarchaeum Syntrophoarchaeum 
butanivorans caldarius 
Size (base pair) 1,456,963 1,666,081 
Scaffolds/Contigs 16/21 10/15 
Scaffold N50 (bp) 219,218 410,601 
Coverage (times) 360 134 
GC content (%) 48.7 45.5 
Number of ORFs 1,604 1,790 
rRNAs 3 3 
tRNAs 39 44 
Genome Completeness (%) 85'; 88°; 89° 95": 97°; 96° 
Contamination? (%) 0.97 0.32 
Duplication of single copy genes” 2 1 
Strain heterogeneity” (%) 0 0) 
Average Nucleotide Average Nucleotide Tetranucleotide 
Identity (Blast) Identity (MUMmer) frequency 
Ca. S. butanivorans/Ca. S. 73.57 89.57 88.19 
caldarius 
Ca. S. caldarius/Ca. S. 73.56 89.03 88.19 


butanivorans 


1Based on tRNA completeness, using tRNAscan; 


?based on archaea-specific single copy genes, using AMPHORA2; “based on lineage-specific marker genes of Euryarchaeota, using CheckM 
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Extended Data Table 3 | Genes encoding enzymes for butane activation, candidates for further conversion reactions and butyryl-CoA 
oxidation in Ca. S. butanivorans 


Gene Feature Locus tag Transcriptome Proteome 
Absolute 
reads FPKM emPAl 
Butane activation 
mcrA Methyl coenzyme M reductase alpha SBU_000314 8320 491.4 0.15 
mcrA Methyl coenzyme M reductase alpha SBU_000718 1640 108.9 0.05 
mcrA Methyl coenzyme M reductase alpha SBU_001328 514 34.13 
mcrA Methyl coenzyme M reductase alpha SBU_001343 1077 70.5 0.11 
mcrB_ Methyl coenzyme M reductase beta SBU_000719 5047 418.0 0.07 
mcrB Methyl coenzyme M reductase beta SBU_001010 2628 210.1 0.46 
mcrB Methyl coenzyme M reductase beta SBU_001329 311 25.7 
mcrB Methyl coenzyme M reductase beta SBU_001341 602 49.0 0.3 
mcrG Methyl coenzyme M reductase gamma SBU_000717 689 97.7 
mcrG Methyl coenzyme M reductase gamma SBU_001009 1787 243.0 0.5 
mcrG Methyl coenzyme M reductase gamma SBU_001327 642 91.4 
mcrG Methyl coenzyme M reductase gamma SBU_001342 366 52.3 0.37 
hdrA CoB--CoM heterodisulphide reductase, SBU_000296 2586 94.9 0.12 
hdrB CoB--CoM heterodisulphide reductase, SBU_000294 222 27.4 
hdrC CoB--CoM heterodisulphide reductase, SBU_000295 127 36.3 0.35 
hdrA Heterodisulfide reductase subunit A SBU_001347 459 16.9 
hdrA Heterodisulfide reductase subunit A SBU_001502 903 60.7 0.05 
hdrA Heterodisulfide reductase subunit A SBU_001503 995 56.9 0.14 
Candidates for conversion to butyryl-CoA 
mtaA  Methyltransferase corrinoid activation protein SBU_000376 1503 83.6 0.36 
mtaA —_Methylcobamide:CoM methyltransferase SBU_000378 583 61.7 0.09 
mtaA —_ Methylcobamide:CoM methyltransferase SBU~000450 988 101.7 0.61 
mtaA —_Methylcobamide:CoM methyltransferase SBU_001175 274 28.4 
mtaA —_Methylcobamide:CoM methyltransferase SBU_001379 133 13.6 
mtaA —_ Methylcobamide:CoM methyltransferase SBU_001480 553 57.7 0.08 
mtaC — Corrinoid protein SBU_000377 1016 156.7 1.33 
mtaC — Corrinoid methyltransferase SBU_001174 95 16.0 
Butyryl-CoA oxidation 
Acyl-CoA dehydrogenase domain-containing SBU_000172 1751 168.5 1.43 
Acyl-CoA dehydrogenase SBU_000399 2844 205.1 0.55 
Acyl-CoA dehydrogenase domain-containing SBU_000724 449 42.9 0.25 
Acyl-CoA dehydrogenase SBU_001146 177 12.4 0.11 
crt Crotonase SBU_000400 1745 236.4 1.65 
3-Hydroxyacyl-CoA dehydrogenase SBU_000288 2425 312.5 0.81 
3-Hydroxyacyl-CoA dehydrogenase SBU_000843 1151 162.5 0.12 
Acetyl-CoA acetyltransferase SBU_000329 103 9.8 
Acetyl-CoA acetyltransferase SBU_000402 3198 295.4 0.34 
Acetyl-CoA acetyltransferase SBU_000404 1644 152.6 0.16 
Acetyl-CoA acetyltransferase SBU_001291 161 15.5 
etfA Electron transfer flavoprotein subunit alpha SBU_000173 557 63.6 0.1 
etfB Electron transfer flavoprotein subunit beta SBU_000174 506 69.8 0.58 
Fe-S Oxidoreductase SBU_000175 402 38.8 0.55 


Expression as absolute read counts and as fragments per kilobase of transcript per million mapped reads (FPKM) is shown, as well as the corresponding protein abundance as emPAI index. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 
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Gene Feature Locus_tag Transcriptome Proteome 
Absolute 
reads FPKM _ emPAIl 
cdhA  Acetyl-CoA decarbonylase/synthase complex alpha SBU_000891 3756 172.8 0.65 
cdhA  Acetyl-CoA decarbonylase/synthase complex alpha SBU_001568 1518 70.5 0.05 
cdhB _ Acetyl-CoA decarbonylase/synthase complex epsilon SBU_000890 557 120.8 0.64 
cdhB _ Acetyl-CoA decarbonylase/synthase complex epsilon SBU_001569 325 65.1 0.35 
cdhC  Acetyl-CoA decarbonylase/synthase complex beta SBU_000889 4798 350.2 0.96 
cdhC _ Acetyl-CoA decarbonylase/synthase complex beta SBU_001570 2738 212.9 0.09 
cdhD  Acetyl-CoA decarbonylase/synthase complex delta SBU_000887 3183 232.8 3.03 
cdhD  Acetyl-CoA decarbonylase/synthase complex delta SBU_001572 940 79.6 1.41 
cdhE  Acetyl-CoA decarbonylase/synthase complex gamma SBU_000886 3419 266.4 0.52 
cdhE  Acetyl-CoA decarbonylase/synthase complex gamma SBU_001573 1503 118.6 1.24 
metV _5,10-Methylenetetrahydrofolate reductase, C-terminal SBU_000428 220 36:2 
metF __5,10-Methylenetetrahydrofolate reductase SBU_000429 487 59:8 0.01 
hdrA Heterodisulfide reductase subunit A SBU_000430 1435 80.5 
hdrB Heterodisulfide reductase SBU_000431 703 45.3 
hdrC Heterodisulfide reductase SBU_000432 1259 8115 
mvhD _ Methyl-viologen-reducing hydrogenase delta subunit SBU_000433 356 53.9 
hdrA Heterodisulfide reductase subunit A SBU_000434 839 47.2 
mvhD _ Methyl-viologen-reducing hydrogenase delta subunit SBU_000435 585 154 
metV _5,10-Methylenetetrahydrofolate reductase, C-terminal SBU_001330 263 39.4 0.25 
metF __5,10-Methylenetetrahydrofolate reductase SBU_001331 711 83.9 0.20 
hdr Heterodisulfide reductase SBU _.001332 392 16.5 
mvhD _ Methyl-viologen-reducing hydrogenase delta subunit SBU_001333 170 57.4 
mch N(5)N(10)-MethenylH4MPT cyclohydrolase SBU_000838 197 22.6 0.10 
ftr FormyIMF-H4MPT formyltransferase SBU_.001141 416 51.4 0.10 
fwdA Formylmethanofuran dehydrogenase subunit A SBU_000443 1215 79.2 0.11 
fwdB —_ Formylmethanofuran dehydrogenase subunit B SBU_000444 555 47.3 
fwdB —_ Formylmethanofuran dehydrogenase subunit B SBU_000048 226 21.4 
fwdC — Formylmethanofuran dehydrogenase subunit C SBU_000442 567 81.3 0.12 
fwdD —_- Formylmethanofuran dehydrogenase subunit D SBU_000047 75 20.6 
fwdD —_ Formylmethanofuran dehydrogenase subunit D SBU_000445 151 43.8 
fwdE —_ Formylmethanofuran dehydrogenase;Subunit E SBU_000903 23 3.8 
fwdF Formylmethanofuran dehydrogenase subunit F SBU_001540 91 8.7 


Expression as absolute read counts and as fragments per kilobase.of transcript per million mapped reads (FPKM) is shown, as well as the corresponding protein abundance as emPAl index. 
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Extended Data Table 5 | Genes encoding proteins related to electron cycling and energy transfer in Ca. S. butanivorans 


Gene Feature Locus_tag Transcriptome Proteome 
Absolute 
reads FPKM emPAI 
[Ni-Fe]-Hydrogenase large subunit SBU_000461 5409 343.7 0.28 
[Ni-Fe]-Hydrogenase small subunit SBU_000462 1702 196.6 
Cytochrome c-type protein SBU_000189 324 73.2 
Cytochrome c SBU_000960 1161 125.5 0.17 
Cytochrome C SBU_001187 184 15.3 
Cytochrome c SBU_001594 62 5.0 0.21 
Multiheme cytochrome SBU_000341 196 25.4 
Multiheme cytochrome SBU_000342 148 13.1 
Multiheme cytochrome SBU_000614 646 83.2 
Multiheme cytochrome SBU_000694 4224 659.7 
Multiheme cytochrome SBU_000777 3660 571.6 
Multiheme cytochrome SBU_000778 3189 395.7 
Multiheme cytochrome SBU_001337 1953 389.1 0.61 
hdrA Heterodisulphide reductase, subunit A* SBU_000297 646 27.0 0.40 
hdrB Heterodisulphide reductase, subunit B* SBU_000298 1146 146.1 
hdrC Heterodisulphide reductase, subunit C* SBU_000299 503 91.7 0.31 
fdhB Formate dehydrogenase subunit beta SBU_000300 838 86.2 0.89 
mvhD  Methyl-viologen-reducing hydrogenase delta SBU_000301 205 53.9 0.78 
fqoJ F420H2:quinone oxidoreductase subunit J SBU_000209 3 0.6 
fqoK F420H2:quinone oxidoreductase subunit K |= SBU_000210 12 4.1 
fgoL F420H2:quinone oxidoreductase subunit L SBU_000211 133 9.7 
fqoM F420H2:quinone oxidoreductase subunit MM §SBU_000213 166 10.6 
fqoN F420H2:quinone oxidoreductase subunit N SBU_000214 142 12.7 
fqoA F420H2:quinone oxidoreductase subunit A SBU_000215 85 26.7 0.26 
fgoBCD F420H2:quinone oxidoreductase subunit SBU.__000216 323 15.5 0.04 
fqoH F420H2:quinone oxidoreductase subunit H |= SBU,000217 171 17.3 
fqol F420H2:quinone oxidoreductase subunit | SBU_000218 112 16.9 
fqoF F420H2:quinone oxidoreductase subunit F SBU_000219 94 10.0 
nuoH — NADH::quinone oxidoreductase subunit H SBU_000563 446 47.5 0.18 
nuoD NADH: quinone oxidoreductase subunit D SBU_000564 458 45.7 0.26 
nuoC NADH: quinone oxidoreductase»subunit C SBU_000565 236 56.8 0.69 
nuoB NADH: quinone oxidoreductase subunit B SBU_000566 345 34.2 
nuoA NADH:quinone oxidoreductase subunit A SBU_000567 30 9.4 
nuol NADH: quinone oxidoreductase subunit | SBU_000874 119 34.2 0.23 
nuoJ NADH:quinone oxidoreductase subunit J SBU_000875 67 30.4 
nuoK NADH: quinone oxidoreductase subunit K SBU_000877 69 24.4 
nuoL NADH: quinone oxidoreductase subunit L SBU_000878 950 54.5 
nuoM — NADH:quinone oxidoreductase subunit M SBU_000879 802 57.6 
nuoN NADH: quinone oxidoreductase subunit N SBU_000880 994 74.9 


Expression as absolute read counts andjas fragments per kilobase of transcript per million mapped reads (FPKM) is shown, as well as the corresponding protein abundance as emPAI index. 


*Submitted as CoB-GoM heterodisulfide reductases. 
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Extended Data Table 6 | BLASTP search of proteins involved in butyrate oxidation. Best results according to the E-value are shown 


Gene product Locus tag Blast annotation AessSslon Organism pevelad> E-value 
number identity 
Acyl-CoA 
dehydrogenase ion Desulfatirhabai 
domain-containing SBU_000172  2CY"¥0 WP 028a2a5594 > ouveNepem 100/66 0.0 
: dehydrogenase butyrativorans 
protein 
heen -CoA Desulfatigl 
dehydrogenase SBU_000399  @CY"¥0 WP_028321791.1 Desulfatigians 99/51 6e:175 
= dehydrogenase = anilini 
Acyl-CoA 
dehydrogenase CoA Desulfatiq! 
domain-containing SBU_000724  @CY"¥0 WP_028319806.1  ~CStWfatigians 98/62 8e-175 
: dehydrogenase anilini 
protein 
Acyl-CoA Peptococcaceae 
dehydrogenase SBU_001146 hypothetical protein WP_036734863.1 bacterium 97/58 0.0 
SCADC1_ 293 
Crotonase Dehalococcoidia 
SBU_000400 hypothetical protein WP_029475295.1 bacterium SCGC 96/69 7e-128 
AB=539-J10 
3-hydroxybutyryl- 
3-Hydroxyacyl-CoA CoA Desulfotignum 
dehydrogenase SBU_000288 dehydrogenase EMS78924.1 phosphitoxidans 100/65 8e-134 
Hbd DSM 13687 
3-Hydroxyacyl-CoA na ear Ze eee 
dehydrogenase SBU_000843 ee WPeg7s0797.1  “edonobacter 98/61 5e-98 
dehydrogenase racemifer 
Acetyl-CoA acetyl-CoA Meth i 
acetyltransferase SBU_000329 acetyltransfefa8e. + WP_014407007.1 snot . 99/67 0.0 
conserved 
Acetyl-CoA hypothetical itured 
acetyltransferase SBU_000402 protein, thiolase CBH39006.1 aneurin 100/70 0.0 
; archaeon 
family 
conserved 
Acetyl-CoA hypothetical orea 
acetyltransferase SBU_000404 protein, thiolase CBH39006.1 panei 100/57 6e-159 
family 
Acetyl-CoA acetyl-CoA Th \ 
acetyltransferase SBU_001291 acetyltransferase © WP_010917519.1 ie aa 98/43 4e-103 
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Extended Data Table 7 | Genes encoding Type IV pili and 10 most expressed cytochromes identified in the HotSeep-1 genome bin from the 


Butane50 culture 


Best Blast Hit Pfam hits Transcriptome 
Gene i i 
Feature Accession Coverage/ Domain ID/ Absolute FPKM 
number Identity accession reads 
type IV pilus N 
: |_methyl_3/ 
PilA sot AMM41792.1 100/99 PF13633.3 8 3 
: pilus assembly N_methyl_3/ 
PilA protein PilA AMM39924.1 100/88 PF13633.3 1780 582 
Dis_P_Dis/ PF06750; 
Peptidase A24 peptidase A24 AMM42236.1 100/100 Peptidase_A24/ 31 9 
PF01478 
pilus AMIN/ PF11741; 
; Bas STN/ PFO7660; 
PilQ ee Pia. AMM40410.1 100/99 Secretin_N/ PF03958: 116 12 
P Secretin/PF00263 
supe.ly pilus T2SSC/ PF11356; 
P 56; 
PilP assembly AMM40409.1 100/99 PiIP/PF04351 8 4 
protein PilP 
PilO pilus assembly ayyi4o408.1 100/100 PilO/PF04350 7 2 
protein PilO 
type IV pilus 
PilN assembly AMM40407.1 100/98 PilIN/PF05137 13 5 
protein PilN 
PilM pilus assembly — anaigo4o6.1 —-100/99 Pil_ 2/PF11104 32 6 
protein PilM 
putative pilus 
PilW assembly AMM39834.1 100/99 N_methyl_2/PF13544 53 10 
protein PilW 
type IV pilus 
PilY1 assembly AMM41700.1 100/98 - 247 12 
protein 
putative pilus ? a 
PilY1 assembly AMMd1699.1 100/99 Weisseria: FIC/PFOSS® a1 16 
protein PilY 
type IV pilus 
PilL assembly AMM41701.1 100/93 - 3 2 
protein pilL 
type IV pilus 
PilC assembly AMM42043.1 97/99 T2SSF/PF00482 (x2) 50 9 
protein PilC 
Cytochrome c type Best Blast Hit Cellular licins Transcriptome 
based on PfamA ee Accession Coveragell localization groups Absolute = poy 
domain prediction number dentity (PSORTb) reads 
Cytochrom_Clll Glass ll AMM42051.1 100/97 _—Periplasmic 4 505 238 
cytochrome C 
Paired CXXCH 1  doubled/CXXCH Aninigoase.1 99/46 Extracellular 5 1244 232 
cytochrome.C 
' doubled CXXCH Unknown 
Paired_CXXCH_1 o¥tochromaC AMM40456.1 100/94 (CM,P,OM.E) 6 768 143 
: doubled GCXXCH Unknown 
Paired_CXXCH_1 cytochnéme C AMM39976.1 100/94 (CM,P,OM.E) 7 471 94 
Paired_CXXGH_1 cytochrome C AMM40455.1 96/95 Extracellular 6 483 96 
Cytochrom_Clll ee c AMM41048.1 100/99 Periplasmic 4 69 31 
Paired_CXXCH_1 cytochrome C AMM40346.1 100/99 Cytoplasmic 10 135 28 
Paired sCXXCH_1 cytochrome C AMM40455.1 99/46 Periplasmic 7 105 21 
Cytochrom_c3.2 cytochrome AMM40350.1 100/100 an 12 72 17 
(CM,P,E) 
Cytochrom_c3_2 cytochrome C AMM40349.1 100/99 Periplasmic 12 53 14 


1For unknown cellular localization of cytochromes, potential locations are indicated according to the score value; CM, cytoplasmic membrane; P, periplasmic; OM, outer membrane; E, extracellular. 
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Tidal evolution of the Moon from a high- obliquity, 
high-angular-momentum Earth 


Matija Cuk!, Douglas P. Hamilton2, Simon J. Lock? & Sarah T. Stewart* 


In the giant-impact hypothesis for lunar origin, the Moon accreted 
from an equatorial circum-terrestrial disk; however, the current 
lunar orbital inclination of five degrees requires a subsequent 
dynamical process that is still unclear!~>. In addition, the giant- 
impact theory has been challenged by the Moon’s unexpectedly 
Earth-like isotopic composition**. Here we show that tidal 
dissipation due to lunar obliquity was an important effect during 
the Moon's tidal evolution, and the lunar inclination in the past must 
have been very large, defying theoretical explanations. We present a 
tidal evolution model starting with the Moon in an equatorial orbit 
around an initially fast-spinning, high-obliquity Earth, which is a 
probable outcome of giant impacts. Using numerical modelling, 
we show that the solar perturbations on the Moons orbit naturally 
induce a large lunar inclination and remove angular momentum 
from the Earth-Moon system. Our tidal evolution model supports 
recent high-angular-momentum, giant-impact scenarios to explain 
the Moon's isotopic composition®* and provides a new pathway to 
reach Earths climatically favourable low obliquity. 

The leading theory for lunar origin is the giant impact®!°, which 
explains the Moon's large relative size and small iron core. Here we 
refer to the giant-impact theory in which the Earth-Moon post-impact 
angular momentum (AM) was the same as it is now (in agreement with 
classic lunar tidal evolution studies!!!) as the ‘canonical’ theory. In the 
canonical giant-impact model’, a Mars-mass body obliquely impacts 
the proto-Earth near the escape velocity to generate a circum-terrestrial 
debris disk. The AM of the system is set by the impact, and the Moon 
accretes from the disk, which is predominantly (>60 wt%) composed of 
impactor material. However, Earth and the Moon share nearly identical 
isotope ratios for a wide range of elements, and this isotopic signature 
is distinct from all other extraterrestrial materials*’. Because isotopic 
variations arise from multiple processes*, the Moon must have formed 
from, or equilibrated with, Earth’s mantle®!*, Earth-Moon isotopic 
equilibration in the canonical model has been proposed by ref. 15, but 
has been questioned by other researchers!®, who suggest that the large 
amount of mass exchange required to homogenize isotopes could lead 
to the collapse of the proto-lunar disk. 

Cuk and Stewart® proposed a new variant of the giant-impact 
model that is based on an initially high-AM Earth-Moon system. 
In this model, a late erosive impact onto a fast-spinning proto-Earth 
produced a disk that was massive enough to form the Moon, and was 
composed primarily of material from Earth, potentially satisfying the 
isotopic observations. Canup’ presented a variation of a high-AM 
origin in which a slow collision between two similar-mass bodies 
produces a fast-spinning Earth and a disk with Earth-like composition 
that becomes the Moon. Subsequently, Lock et al.* have argued that 
a range of high-energy, high-AM giant impacts generate a particular 
post-impact state where the Earth’s mantle, atmosphere and disk are 
not dynamically isolated from each other, enabling widespread mixing 
and equilibration between the accreting Moon and Earth. After the 


impact, these high-AM models require a plausible mechanism with 
which to remove AM to be consistent with the current Earth-Moon 
system. 

Cuk and Stewart® originally proposed that the excess AM was lost 
during tidal evolution of the Moon via the evection resonance between 
Earth’s orbital period and precession of the Moon's perigee®. More 
recently, Wisdom and Tian’’ found that the evection near resonance 
can reduce the system’s AM to the present value for a wider range of 
tidal parameters than explored by ref. 6. However, AM loss through 
the evection resonance is still confined to a subset of possible tidal 
parameters for Earth and the Moon, and the high-AM giant-impact 
scenario requires a robust mechanism for reproducing the present-day 
system. In this work, we propose a more plausible model of lunar tidal 
evolution that removes AM, but also may solve another major problem 
for lunar origin: the Moon’s orbital inclination. 

The Moon’ orbit is currently inclined by about 5°, but studies of its 
tidal evolution!” have found that the inclination would have to have 
been at least 12° at formation, if the inclination was primordial. This is 
at odds with lunar formation from a flat disk in Earth's equatorial plane, 
which should produce a Moon with no inclination. Hypotheses that 
have been proposed to explain the lunar inclination include a sequence 
of Moon-Sun resonances!, resonant interaction with the protolunar 
disk’, and encounters with large planetesimals after the formation of 
the Moon’. However, past studies of lunar tidal history®'!'? ignored 
the obliquity tides within the Moon, despite the Moon having very 
large ‘forced’ obliquity when it was between 30 and 40 Earth radii 
(Rg) away from Earth owing to the lunar spin axis undergoing the 
Cassini state transition'®!°. Chen and Nimmo” found that the lunar 
obliquity tides (driven by Earth’s apparent north-south motion 
relative to the lunar figure (body)) greatly decrease the Moon's orbital 
inclination. 

To quantify the effect of obliquity tides, we used a semi-analytical 
tidal model (see Methods subsection ‘Damping of lunar inclination 
by obliquity tides’). Chen and Nimmo”’ considered tides within the 
lunar magma ocean that rely on excitation of Rossby waves”!, but 
here we considered only the tidal response of the current, ‘cold’ Moon. If 
we additionally assume long-term average tidal dissipation within Earth 
and current lunar tidal properties, we find that the orbital inclination of 
the Moon must have been substantially higher before the Cassini state 
transition, possibly as high as 30° (Methods subsection ‘Damping of 
lunar inclination by obliquity tides’ and Extended Data Fig. 1). If we 
were to extrapolate this inclination back to the time of lunar formation 
close to Earth, the Moon must have formed with an orbit inclined at 
over 50° to the equator, which is clearly inconsistent with a giant- 
impact origin and suggests that the inclination was acquired after the 
Moon formed. In addition, the planetesimal encounter hypothesis? has 
difficulty producing both the correct lunar eccentricity and inclination 
simultaneously when lunar inclination damping by lunar obliquity tides 
and lunar eccentricity excitation by Earth tides are taken into account 
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(Methods subsection ‘Excitation of lunar inclination by encounters with 
planetesimals’ and Extended Data Fig. 2). 

We show that the tidal evolution of the Moon starting with a 
high-obliquity, high- AM Earth can reproduce the current lunar orbit, 
including the lunar inclination and the Earth-Moon system AM. For 
any perturbed orbit, there exists a Laplace plane around which the 
orbital plane of the perturbed orbit precesses. For close-in moons of 
oblate planets like Jupiter and Saturn, the Laplace plane is the equatorial 
plane of the planet, whereas for outer irregular satellites of these planets 
that are strongly perturbed by the Sun, the Laplace plane is their 
planet's heliocentric orbital plane. For the Moon, the Laplace plane under- 
goes a transition during lunar tidal evolution when the Moon recedes 
from the inner region dominated by perturbations from Earth’s 
equatorial bulge to the outer region dominated by solar perturbations. 
In the inner region, the Laplace plane is close to Earth’s equator, and 
in the outer region, the Laplace plane is close to the plane of Earth's 
heliocentric orbit (the ecliptic). At the transition between these two 
regimes, the Laplace plane is intermediate between the equator and 
the ecliptic. The distance at which the Laplace plane transition occurs 
is approximately” 


1/5 
Mp7 3 
ry =|2J,—=Rea (1) 
L | *"'Ms | 


where J> is the oblateness moment of Earth, Mg and Msg are the mass 
of Earth and the mass of the Sun, and ag is Earth’s semimajor axis. 
For a planet like Earth that is in hydrostatic equilibrium, J, depends 
on the rotation rate. Therefore, the critical distance for the Laplace 
plane transition has been moving inward over the course of lunar tidal 
evolution as Earth's rotation slows down and Earth becomes more 
spherical. For an Earth-Moon system with 100%-180% of the present 
AM, the Laplace plane transition happens at 16-22Rg. 

For small and moderate obliquities of Earth (that is, angles between 
the Equator and the ecliptic), the Laplace plane transition is smooth 
and does not produce any excitation of lunar eccentricity or inclination. 
However, the Laplace plane transition causes orbital instabilities for 
obliquities above 68.9° (ref. 23). Satellites on circular orbits around 
high-obliquity planets migrating through the Laplace plane transition 
can acquire substantial eccentricities and inclinations. This excitation 
is driven by solar secular perturbations that operate at high inclinations 
(“Kozai resonance”). For high planetary obliquities, satellites close 
to the Laplace plane transition with low ‘free’ inclinations (to the local 
Laplace plane) still experience solar Kozai perturbations, because their 
orbits have large instantaneous inclinations relative to the ecliptic’®. 
A related mechanism produces complex dynamics previously found by 
ref. 26 for the tidal evolution of hypothetical high-obliquity Earth-like 
planets with large moons. Atobe and Ida”® also found that the mass of 
a Moon-sized satellite has a large effect on the dynamics of the system, 
enabling stagnation or even reversal of tidal evolution and large-scale 
AM loss from the system; however, they used an averaged model that 
did not track eccentricity and could not capture the full dynamics of 
the Laplace plane transition. 

To study the tidal evolution of the Moon from a high-obliquity 
Earth followed by inclination damping at the Cassini state transition, 
we wrote a specialized numerical integrator, R-SISTEM, which resolves 
lunar rotation and therefore fully models lunar obliquity tides (Methods 
subsection ‘Numerical methods’). Figure 1 (see also Supplementary 
Video 1) shows the early tidal evolution for two simulations that assume 
that the Earth-Moon system initially had 1.8 times its current AM, 
as proposed by ref. 6, but with an initial obliquity to the ecliptic of 
70°. Solar perturbations induce sizeable lunar eccentricity when the 
Moon reaches the Laplace plane transition at about 17Rg, triggering 
strong eccentricity-damping satellite tides that shrink the semimajor 
axis and approximately balance the outward push of Earth tides 
(see the first 16 Myr of Fig. 1). Because eccentric orbits have less AM 
than circular ones, AM is removed from the lunar orbit and transferred 
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Figure 1 | Numerical simulation of the Moon’s early tidal evolution 
from Earth with initial obliquity of 70° and spin period of 2.5h. 

a, Semimajor axis. c, Eccentricity. d, Inclination to the ecliptic of the lunar 
orbit. e, Earth’s obliquity to the ecliptic and the AM of the system (b) in 


units of ag, [GM3Rz, where Mg, Rp and ag = 0.33 are the mass, radius 

and the scaled moment of inertia of Earth, respectively, and G is the 
gravitational constant. In these units, the present AM is 0.35. The grey lines 
plot a simulation in which tidal properties of Earth and the Moon were 

ko. z/Qr= ko,m/Qu =0.01 throughout (see Methods subsection ‘Damping of 
lunar inclination by obliquity tides’ for definitions). The black line shows a 
simulation branching at 20 million years (Myr) after the giant impact by 
changing ky,z/Q to 0.005. In the AM plot (b), the thin lines plot a scalar sum 
of spin and orbital AM, while the lower (thick) band includes only the 
component of the lunar orbital AM vector that is perpendicular to the ecliptic. 


to Earth’s heliocentric orbit; Earth tides in turn transfer AM from 
Earth's spin to the lunar orbit, whereas satellite tides do not change the 
AM of the Earth-Moon system. During this prolonged stalling of lunar 
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Figure 2 | Lunar inclination and Earth’s obliquity for various 
simulations. The initial and final lunar inclination (a) and Earth’s 
obliquity (b) are plotted against corresponding values of the Earth- 
Moon system AM in different simulations. AM is plotted in units of 

ag,{ GMi.Rg, and all simulations of the Laplace plane transition included 
here started with Earth’s obliquity of 70°. The squares correspond to 
simulations plotted in Fig. 1: the open square plots a simulation with 

Q:/ kop = Qu/k2,m = 100 and the filled square plots the branch for which 
Qr/ko,z = 200 after 20 Myr. Circles plot simulations with Qp/ko,7 = 200 
throughout (shown in Extended Data Fig. 5), with white, grey and black 
circles corresponding to cases with Quy/k2,q = 200, 100 and 50, respectively. 
Simulations with higher initial AM (2.25h initial spin) (Extended Data 
Fig. 6) are plotted with triangles: the open triangle plots a simulation with 
Qe/k2,z = Qu/k2,.m = 100 and the filled triangle plots the branch for which 
Qz/ky.2 = 200 after 30 Myr. All simulations saw a substantial loss of AM, 
excitation of lunar inclination to large values, and a large reduction of 
Earth’s obliquity. We conclude that a smaller Qg leads to a larger AM loss, 
while a greater Q; later in the Laplace plane transition leads to smaller 
final obliquity for Earth. 


tidal evolution, the Moon acquires large inclination (over 30°), while 
the obliquity of Earth decreases. 

In the later part of this complex period of the lunar orbital history, 
lunar eccentricity is excited by secular near-resonances between lunar 
inclination and eccentricity and Earth’s oblateness (see Methods 
subsection ‘Laplace plane transition, and Extended Data Figs 3 and 4). 
Depending on the exact tidal parameters used for Earth and the Moon, 
Earth's obliquity can reach that required to match the present value 
(<20°)?’, while the AM of the system also matches the present 
value (0.35 in units of ag,/GMpRg; see Fig. 1 legend for definitions). 
Figure 2 presents the results of the Laplace plane transition simulations 
using different tidal parameters for Earth and the Moon and a different 
initial AM of Earth (Extended Data Figs 5 and 6). Large AM loss, 
high lunar inclination and low terrestrial obliquity is a frequent 
outcome. 

In Fig. 3, we explore the early part of the Laplace plane transition for 
different initial obliquities of Earth (with the same total AM). Cases 
with initial obliquities of 65° and 75° are similar to Fig. 1, with larger 
obliquity leading to larger AM loss in the early stages of the Laplace 
plane transition. The simulation with an initial obliquity of 80° experi- 
ences stalling and reversal of tidal evolution, with the Moon eventually 
falling back onto Earth. This evolution agrees with prior results”°, and 
we expect this outcome for all obliquities larger than 80°. We also found 
that the Moon evolving from Earth with an initial obliquity of 60° does 
not experience any instability at the Laplace plane transition. As the 
orientation of terrestrial planets’ spin axes is probably determined by 
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Figure 3 | Similar to Fig. 1, but with different initial obliquities for 
Earth. Obliquities are 80° (black line), 75° (red line) and 65° (blue 

line). Qg/ko,z = Qu/k2,m = 100 throughout the simulations. Unlike the 
simulations shown in Fig. 1, these simulations were followed for only 

20 Myr. The 80° simulation leads to the Moon falling back on Earth. 

We note a trend that the AM loss (b) is larger for higher initial obliquities 
of Earth. 


giant impacts, their poles should be randomly distributed on a sphere. 
About a third of all Earth-like planets should have obliquities within 
60°-80° or 100°-120°, and, if they have large moons, would lose AM 
and obliquity (without losing the moon) at the Laplace plane transition. 
Furthermore, Kokubo and Genda”’ found that planets with large AM 
are a frequent model outcome of terrestrial planet formation. 

Once the Moon has passed through the Laplace plane transition, 
it continues to recede from Earth and lunar rotation passes through 
the Cassini state transition). Using R-SISTEM, we find that the Moon 
probably spent some time in a non-synchronous rotation state when 
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Figure 4 | Numerical integration of the later phase of lunar tidal 
evolution, assuming a lunar inclination of 30° at 25R, and the current 
shape of the Moon. The panels plot lunar inclination to the ecliptic (a), 
lunar obliquity with respect to its orbit (b) and lunar spin rate (c) against 
the Moon’s semimajor axis. The red points plot the segments of the tidal 
evolution that were artificially accelerated, while the blue points plot 

the intervals integrated with nominal parameters. We used k7,5 = 0.3, 

Qr = 35, kom = 0.024 and Quo = 60 (for the numerical implementation of 
lunar tides, see Methods subsection ‘Numerical methods’), with the Love 
numbers enhanced by 100 times before the event at 29.7R, and by 10 times 
after that event. Black lines plot the Cassini state obliquity (b) and the 
synchronous rotation rate (c) expected for each instantaneous semimajor 
axis and inclination. The Moon is in non-synchronous rotation from 
29.7RE to 35Rp. 


close to the Cassini state transition (Fig. 4, Extended Data Figs 7 and 
8) and that transitions between the rotation states can be triggered 
by various resonances or impacts (Methods subsection ‘Cassini state 
transition, Extended Data Figs 9 and 10, and Supplementary Video 2). 
Regardless of the nature of the lunar rotation state, the Moon’s obliquity 
is very high during the Cassini state transition and immediately 
following it, leading to damping of lunar inclination. Although the 
implementation of high-obliquity satellite tides in a fully numerical 
integrator is challenging (Methods subsection ‘Numerical methods’), 
we find that the lunar inclination damps from 30° (obtained during the 
Laplace plane transition) to its present value if we assume the long-term 
average tidal properties for Earth and a relatively non-dissipative, solid 
Moon (Fig. 4). 

The rotational dynamics of the Moon is strongly dependent on the 
Moon’ global shape. The early Moon probably had little strength, and 
its shape was in equilibrium with tidal forces”. In Fig. 1, we modelled 
the Moon as a rigid body for numerical tractability, but we periodi- 
cally reset its figure to match an equilibrium shape for that distance 
from Earth, assuming synchronous rotation’. This assumption of a 
hydrostatic-like shape results in low obliquities in Cassini state 1 when 
the Moon is close to Earth’®. Since the current triaxial shape of the 
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Moon matches the order of magnitude of tidal deformation expected 
at 23-26Rgy (ref. 29), we assumed that the Moon is rigid and has the 
present-day principal moments beyond 25Rg. Our orbital history, 
shown in Fig. 1, may be consistent with the proposal that the current 
lunar shape froze at a distance of 15-17R¢ from Earth on an orbit with 
e © 0.2 (ref. 30). 

Our high-obliquity model is at present the only model we are 
aware of to explain the origin of large past lunar inclination, which 
was subsequently reduced by strong obliquity tides at the Cassini state 
transition”’. A high-obliquity early Earth offers an AM loss mechanism 
more robust than the evection resonance®!”. Therefore, our results 
support high-AM giant-impact scenarios for lunar origin®®. An initially 
high-obliquity Earth is consistent with the expectation of random 
spin-axis orientations for terrestrial planets after giant impacts, and the 
dynamics discussed here naturally reduces Earth's obliquity to values 
that are low to moderate. This mechanism also provides a novel way 
in which initially highly tilted terrestrial exoplanets can acquire low 
obliquities and potentially stable climates. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Damping of lunar inclination by obliquity tides. To quantify the effect of 
obliquity tides on the lunar orbit, we constructed a semi-analytical method for 
modelling the evolution of lunar orbit under the influence of Earth and Moon tides. 
This model assumes that the Moon is in synchronous rotation and the relevant 
Cassini state, unlike the fully numerical integrator R-SISTEM that we use elsewhere 
in this work, which fully resolves lunar rotation. The goal of this model is (1) simply 
to demonstrate the importance of lunar obliquity tides during the Cassini state 
transition, and (2) to allow for more efficient integration of the lunar orbit beyond 
40Rg, when the Moon was probably in Cassini state 2, and its obliquity, inclination 
and eccentricity were moderate or small. 

The semi-analytical model is described by a system of equations 
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where a, e, iand n are the semimajor axis, eccentricity, inclination and mean 
motion of the lunar orbit, respectively; f is time; M, R, kp and Q are mass, radius, 
Love number and tidal quality factor, while the subscripts E and M refer to Earth 
and Moon, respectively (subscripts after derivatives indicate whether the effect 
is due to Earth or lunar tides). Lunar obliquity to the orbit, 0, is calculated as the 
solution to the equation!§? 


2 | Sas }sindcose | é »( == sina cos#) + Qsin(@—i)=0 (7) 
3 C 8 C 
where A, B and Care the Moon's principal moments of inertia, and 2 is the rate of 
precession of the lunar node, which we estimate as** 
2 
0 = 2. cosi (8) 
4n 
where ng is Earth’s heliocentric mean motion. This expression for (2 assumes 
precession dominated by solar perturbations and is invalid close to Earth, where 
the influence of its equatorial bulge is important. Our numerical implementation 
is a simple mapping that solves for lunar obliquity, calculates tidal derivatives and 
then advances the orbital elements (we used a 1-Myr timestep, appropriate for 
distances beyond approximately 25Rp). 

Extended Data Figure 1 plots three orbital histories calculated using the above 
approach that result in the Moon moving from 25Rg to 60R¢ over 4.5 billion years, 
and having a final inclination of about 5° (the Moon’s eccentricity was about 0.01 
in these calculations). Extended Data Figure 1 clearly shows that the past studies 
of lunar tidal evolution that neglected lunar tides'"” greatly underestimate the 
past inclination of the Moon. Even assuming a tidal quality factor of Q= 100 
(which is larger than the present value of 38) for the Moon during the Cassini state 
transition, the inclination had to be over 17° at 25Rg, which would in turn require 
an inclination of approximately 30° when the Moon was close to Earth, much 
greater than the often-cited 12° (ref. 12). If we adopt the current tidal parameters 
of the Moon (Q= 38), the inclination at 25Rg was over 30°, seemingly at odds with 
the formation of the Moon in an equatorial disk. 

Although our simplified model uses several approximations, we can argue that 
the values for the past lunar inclination that we calculate are underestimates, for 
at least three reasons. First, equation (5) assumes low obliquities, and at obliquities 
close to 90°, the leading coefficient is 15/8 rather than 3/2 (ref. 19), leading to 
a more rapid damping of inclination. Second, we assumed the current Love 


number for the Moon (kz, = 0.024), which implies that the Moon was as rigid 
at the Cassini state transition as it is now; however, because the Moon was only a 
few hundred million years old and much warmer in the interior at the time of the 
transition, its Love number would be higher than it is now, again leading to more 
rapid inclination damping. Third, we assumed constant tidal properties for Earth 
throughout the calculation, which is in conflict with the fact that the current rate of 
the Moon's tidal recession is about three times higher than the long-term average’. 
An increase in the tidal dissipation within Earth’s oceans over time, as suggested 
by modelling***” would mean that the Moon spent more time at the Cassini state 
transition than we calculated, allowing more damping of inclination. These factors 
are all independent of the fact that the Moon may not have been in synchronous 
rotation during the Cassini state transition, as discussed in this work. 
Excitation of inclination by planetesimal fly-bys. Recently, Pahlevan and 
Morbidelli? proposed that the lunar inclination was produced by encounters 
between the Earth-Moon system and leftover planetesimals. While innovative, 
this model has several outstanding issues which caused the authors substan- 
tially to overestimate the effectiveness of planetesimal encounters in raising the 
lunar inclination. They? do not include lunar obliquity tides explicitly in their 
model, but assume that very high tidal dissipation rates within Earth moved 
the Moon to 40R¢ within a few tens of millions of years, allowing most of the 
planetesimal encounters to occur after the Cassini state transition, keeping the 
newly acquired inclination safe from further damping. This timeline works if 
ky x/Qg=0.1 for the early Earth (which we find surprisingly high for a planetary 
body), but not for Earth’s long-term average k>,:/Q: = 0.01, which would put the 
Moon at the Cassini state transition during the epoch of planetesimal encounters. 
However, given the uncertainties in the tidal properties of early Earth, here we 
will concentrate instead on the orbital mechanics aspects of the Pahlevan and 
Morbidelli? model. 

Extended Data Figure 2 shows the tidal evolution of the Moon using our 
semi-analytical model (Methods subsection ‘Damping of lunar inclination by 
obliquity tides’), starting with the end-state of the simulation featured in figure 1 
of ref. 3 (a=47Rg, i=5.8°). Eccentricity was not specified in ref. 3, but it was stated 
that the encounter simulations typically result in e = 2sini, so we have used e=0.2. 
We find that there is, starting from these initial conditions, no combination of tidal 
parameters for Earth and the Moon that can result in the correct eccentricity and 
inclination for the Moon at the present epoch. If we assume a larger excitation of 
lunar inclination, the correspondingly larger lunar eccentricity (given the e=2sini 
condition) counteracts Earth tides and slow down lunar recession. Stronger Earth 
tides lead to faster outward evolution but also produce a net increase of lunar 
eccentricity (owing to equation (2)), potentially leading to a reversal of the Moon's 
orbital evolution through lunar eccentricity tides. A slowdown or reversal in lunar 
tidal recession would preclude the Moon from reaching its present distance in 
4.5 billion years. Therefore we conclude that for planetesimal encounters to be able 
to explain the lunar inclination, the encounters must excite inclination without 
substantially exciting eccentricity (that is, leave the Moon with a lower eccentricity 
than that reported by ref. 3). Given the stochastic nature of the process, a small 
number of encounter outcomes will have a low eccentricity and high inclination. 
However, while possible, such scenarios are statistically unlikely and therefore 
not compelling. 

Mechanisms other than tides have the potential to alter lunar eccentricity. It 
has been suggested that the current lunar eccentricity of 0.055 is the product of 
a resonance between the Moon and Jupiter that happens when the Moon is at 
about 53Rg (ref. 38). This resonance arises because the rate of precession of the 
lunar longitude of pericentre is commensurate with the mean motion of Jupiter 
(Jovian evectiom). The rate of precession is (ignoring lunar inclination, which is 
not affected by the resonance)**: 
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where eg is the eccentricity of Earth’s orbit. Because eg varies rapidly on 10°-year 
timescales owing to Milankovi¢ cycles*’, capture is impossible and the Moon 
crosses the resonance many times in both directions. These numerous crossings 
result in a random walk in lunar eccentricity, within a band in a and e associated 
with the resonance**. Since this band moves to higher e for higher a, it is possible 
to reach arbitrary large eccentricities through this random walk; however, it is 
likely that the Moon will sooner or later reach the outer boundary of the resonant 
band (defined by the resonance location for eg = 0), and exit this chaotic region 
with finite eccentricity, as shown in ref. 38. 

However, it is also possible that through the random walk the lunar orbit will 
evolve to lower eccentricities. This reduction of eccentricity cannot be arbitrarily 
effective: it must be smaller than the thickness of the band in e for the semimajor 
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axis at which the Moon enters the resonant band. Keeping @ and a constant, the 
thickness of the band in e is determined by the variation in eg: 
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where ég,max is the maximum eccentricity of Earth during Milankovic cycles (the 
minimum is zero) and ex and ei, are the boundaries of the band at the 
semimajor axis at which the Moon encounters the resonant band. Entering 


the band with e,,ax, the Moon cannot acquire an eccentricity lower than ein. So if 
we square equation (10) and keep only the quadratic terms in eccentricities, we get 
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Therefore, for eg max = 0.06 (ref. 39), a starting eccentricity of e=0.2 cannot be 
reduced by the jovian evection by more than about 0.03, so the jovian evection 
does not substantially change the implications of Extended Data Fig. 2, namely 
that the lunar eccentricity will probably remain high if it was substantially excited 
by planetesimal encounters when the Moon was already beyond 40Rg. 
Numerical methods. Our dynamics code R-SISTEM (Rotation-Symplectic 
Integrator with Solar Tides in the Earth-Moon system) directly integrates both 
the orbital and rotational motion of the Moon. The Moon is treated as rigid and tri- 
axial, and experiences tidal accelerations, the effects of Earth’s equatorial bulge and 
solar perturbations. The orbital part of the integrator is a symplectic mixed-variable 
integrator based on the principles of ref. 40, with the specific implementation taken 
from ref. 41. The integrator assumes the Moon is on a Keplerian orbit, with all 
other forces (including solar perturbations) inserted as periodic ‘kicks’ in Cartesian 
coordinates. The orbital part of R-SISTEM overlaps substantially with the more 
general-purpose satellite dynamics code SIMPL”, which has been extensively 
tested. Like SIMPL, R-SISTEM can include other planets (or other satellites of 
Earth) as perturbers, but neither were present in any of the integrations included 
in this paper. With no other planets, Earth’s orbit was essentially Keplerian, as 
R-SISTEM ignores back reaction from tides and Moon-Sun interactions on Earth’s 
heliocentric orbit. 

Integration of the Moon's rotation was based on the Lie—Poisson approach of 
ref. 43, with the Moon treated as a tri-axial rigid body torqued by Earth and the 
Sun. This enables direct modelling of the Moon's axial precession and Cassini 
states, and allows for capture of lunar rotation into spin-orbit resonances, such as 
synchronous rotation. One issue we had to deal with is suppression of tumbling 
(that is, Chandler wobble), which can be triggered by some spin-orbit interactions. 
We adopted an approach similar to that of ref. 44, where a torque perpendicular to 
the AM vector acts on the AM vector (in the rotating reference frame) to push the 
AM vector towards the z-axis (the axis of the largest principal moment of inertia). 
The intensity of this torque is adjusted to match the wobble damping timescales 
predicted by ref. 45. If the AM vector is closer to the long axis of the body (that is, 
that associated with the minimum moment of inertia), a similar torque pushes the 
AM vector away from the long axis. These torques have reversed signs in opposite 
hemispheres, so that the wobble damping is the same regardless of the sense or 
rotation (in the body-fixed reference frame). 

Although the rotation of Earth is not resolved in our model, it tracks the 
changes in Earth’s rotation period caused by the tides. The oblateness of Earth 
is adjusted at the end of each timestep, depending on the new value of the spin 
rate. We assumed that J, + we holds for all spin periods (where J, is the standard 
oblateness moment and ws is Earth’s spin rate), which may be inaccurate if 
Earth was very non-spherical owing to fast spin. Earth’s axis is made to precess 
as a result of instantaneous torques by the Moon and the Sun on the equatorial 
bulge (that is, we assume that Earth is purely oblate and in principal axis 
rotation). 

The most important part of the numerical approach is the tides. Cuk and 
Stewart® approximated lunar tides as a radial force counteracting the radial motion 
of the Moon. Although this implementation could match eccentricity damping 
(assuming that the Moon is in synchronous rotation), it is not useful in modelling 
obliquity tides (because a radial force is always in the plane of the orbit). We still 
include a radial lunar tide, but it accounts only for the damping expected from 
the actual, physical radial tide (as opposed to libration and obliquity tides). The 
expression for the radial force used in the integrator is: 
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where r and? are the Moon's geocentric distance and radial velocity. Using the 
expression for r (ref. 39) 


t= 4 _esinf 


V1—e? 


where fis the Moon’s true anomaly, and the expression for the eccentricity damping 
by generalized accelerations is**“° 
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where F, is a tangential acceleration (not present here) and E is the eccentric anom- 
aly. We can then write 
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To the lowest order in eccentricity, this expression averages over an orbital period to 
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This is the analytical expression for the eccentricity damping expected from radial 
tides for a moon in synchronous rotation®’. Although our radial force approximates 
the predicted damping of eccentricity to the lowest order, our approach is funda- 
mentally different from that taken in analytical derivation. Although ref. 39 shows 
that energy loss is expected during a monthly cycle of tidal stresses, our integrator 
‘does not know’ that the cycle repeats monthly. All of the forces acting in our code 
are based only on the current positions and velocities (linear or angular) of the Sun, 
Earth and Moon. Despite these differences in approach, our force form has a clear 
physical meaning: energy is lost when a somewhat inelastic Moon moves radially 
in Earth's tidal field, as the mechanical energy going into elastic deformation turns 
into heat. Our force is strictly radial, so it does not affect the AM of the lunar orbit, 
in agreement with previous treatments of satellite tides*”. 

We have also applied the principle that only the instantaneous quantities can 
be used to calculate tidal forces due to librational and obliquity tides. They are not 
treated differently, but are combined in the same ‘kick; as they both arise from the 
Moon’ orientation moving relative to the Earth-Moon line. An equal and opposite 
torque is applied to the Moon’s spin, enabling despinning and damping into the 
synchronous rotation. Our approach is based on analogy with the general case of 
tides on a non-synchronous body; if we assume that the Moon is rotating much 
faster than it orbits the Earth, the acceleration experienced by the Moon from tides 
raised by Earth on the Moon would be? 
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where ¢ is the lunar spin rate. We may be tempted to apply this equation to the 
synchronous case, and assume that the lag angle is independent of synodic 
frequency (equivalent to having constant Q) so that the angular acceleration simply 
changes sign when the rotation is slower than orbital motion*’. We can estimate 
the effect of such tidal acceleration on eccentricity (with obliquity set to zero), using 
equation (14): 
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For small eccentricities and synchronous rotation, we can assume that 
é- f = — 2encosM, (M being the mean anomaly) so we need to integrate the 
above expression over M= [—7/2, 1/2] with a negative sign, and over 
M=[n/2, 3x/2] with a positive sign. That would give us an expression for 
eccentricity damping that does not depend on eccentricity, which is certainly 
nonphysical. This is because strict constant-Q tides behave like a step function at 
the exact synchronous motion, without any sensitivity to the amount of deviation 
from synchronicity. 

However, if we assume that the tidal Q does depend on frequency, according to 
Q1=Q 126 — f )/n and keep only the lowest terms in eccentricity, we get 
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which is the correct form for the eccentricity damping by the libration tide within 
a synchronous satellite”. Therefore, in order to have a unified treatment of tides 
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in synchronous (or near-synchronous) and non-synchronous cases, we need to 
assume that the lag angle is proportional to the angular velocity of libration ¢ — f 
when the deviation of the rotation rate from orbital motion is smaller than the 
orbital motion, and to assume that the lag angle is constant when the rotation is 
much faster than orbital motion. An alternative of constant time-lag tides (for all 
frequencies) is possible but leads to very large lag angles and very fast tidal 
accelerations when early Earth had a fast spin*’; our even higher AM initial 
conditions would make this problem worse. 

In order to treat tides on the Moon and Earth uniformly, including the obliquity- 
related terms, we implemented the following relationship between the tidal Q 
(inverse of the lag angle) and frequency 


1 1 
= l--=-— = 20 
Q=Q/l+ 5-5 (20) 
with the parameter 6 defined as: 
5=|w— v,|/|v,| (21) 


where ¥; is the tangential component of the perturber’s apparent velocity relative 
to the perturbee’s (that is, the deformed body’s) centre of mass, and the vector w is 
defined as w=s x r, with s being the angular velocity vector of the perturbee, and 
r the radius-vector of the perturber relative to the perturbee. This gives us Q= Qo/é 
for small librations, and Q= Q for a fast-rotating body, and a reasonable transition 
when 6 © 1. The way this lag angle is implemented in the integrator is to place a 
prolate quadrupole moment on the perturbee, with the axis of symmetry defined 
by the vector # + 6(2Q)~! (where hats denote unit vectors). This quadrupole 
moment (described by equation 4.145 of ref. 39) then torques (and is torqued by) 
the perturber, producing tidal accelerations. On the Moon we have only the tidal 
bulge raised by Earth but on Earth we have both lunar and solar tidal bulges, each 
of which torques both the Sun and the Moon; Moon-Sun tidal cross-terms have 
been found to be important!*“*. 

Since we treated all orientations of relative motions between the orbital motion 
and the perturbee’s velocity equally, this formulation does not discriminate between 
(eccentric) libration tides and obliquity tides. Our tests show that the integrator 
correctly reproduces the expected relationship’? 
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Our ‘levelling off’ of the tidal phase lag for libration rates equal to about half 

of the orbital motion has little bearing on eccentricity evolution, given that lunar 
eccentricities in our scenario are rarely above 0.2. However, we do encounter 
large obliquities around the Cassini state transition, which means that we may 
be underestimating inclination damping at high obliquities when compared to 
analytical calculations!®*!. We think that our approach is justified, because stronger 
dissipation at large obliquities would require tidal lag angles much larger than 
that corresponding to the tidal Q in the fast-rotation case, which we treat as the 
upper limit on the lag angle. Further work may be needed to reconcile different 
definitions of tidal Qs in different treatments of high-obliquity tides. 
Laplace plane transition. In Fig. 1, where a fast-spinning Earth’s obliquity was set 
to 70°, we observe the instability associated with the Laplace plane transition when 
the Moon is at 17-18Rg. During this transition the Moon acquires large inclination, 
which is similar to the findings of ref. 26. We find that the behaviour of lunar eccen- 
tricity is rather complex, exhibiting spikes and crashes, separated by relatively stable 
periods of moderately excited eccentricity. To understand these quasi-stable states, 
we looked in detail at the results of the simulation at 12.8 Myr (Extended Data 
Fig. 3). It is clear that the eccentricity is excited by Kozai-type perturbations™*””, 
which are secular and involve interaction between an outer perturber (the Sun 
in this case) and an inclined perturbee (the Moon). Unlike in standard Kozai 
dynamics of planetary satellites**!, Earth’s obliquity has an important role, and 
the inclination oscillates owing to mutual precession of the lunar orbit and Earth's 
spin axis, so that the inclination and eccentricity are not simply anti-correlated. 
Extended Data Figure 3 shows that this dynamical state is periodic in its secular 
behaviour, with exactly three inclination cycles for every eccentricity cycle. 
We hypothesize that each of the stable intervals in eccentricity (Figs 1 and 3, 
and Extended Data Figs 5 and 6) represent a periodic secular state where certain 
precession periods are locked in resonance. 

Extended Data Figure 4 shows a different ‘slice’ of the simulation from Fig. 1 
for the Qg/k2,z = 200 branch of the simulation (black line) at 34.6 Myr. At this 
time, inclination is about 30° and the Laplace plane is dominated by the Sun. The 
eccentricity is still excited, but not by Kozai-type perturbations. The source of 
eccentricity excitation is near-resonant perturbations stemming from the slow- 
varying argument Y= 32+ 2w — 37, where 2 and y are the longitudes of the 


lunar ascending node and Earth's vernal equinox, respectively, and w is the Moon's 
argument of perigee. This interaction term combines solar perturbations and the 
precession of Earth’s spin axis, so it is capable of changing the obliquity of Earth. 
Since this interaction does not require capture into resonance, it is probably less 
sensitive to random perturbations such as planetesimal encounters® than a narrow 
resonance would be. We hope to study the dynamics of this and similar near- 
resonant terms in future work. 

Extended Data 5 shows the evolution of the Earth-Moon system with the 
same initial conditions as in Fig. 1, but with Qg/k2,5 = 200 from the beginning 
of the simulation. The black line plots the case with Qyy/k2,m = 200, and the grey 
line represents Qyy/k2,q =50. We also ran a simulation with Qyi/ko4 = 100, and it 
was in all ways intermediate between these two. Overall, the simulations shown in 
Extended Data Fig. 5 have low final obliquities, but have an excess of AM compared 
with the real Earth-Moon system. We conclude that higher Qg during the early 
part of the Laplace plane transition leads to less AM loss than in the case shown in 
Fig. 1, but that it leads to larger reduction in obliquity during the later part of the 
simulation. Lunar tidal properties appear to be less important, with a more dissi- 
pative moon leading to slightly more AM loss and lower final obliquities for Earth. 

In Extended Data Fig. 6 we started Earth with a spin period of only 2h (as 
opposed to 2.5h in Fig. 1). Just like in Fig. 1, we changed the tidal properties 
of Earth halfway through the simulation, leading to two different outcomes 
(see legend to Extended Data Fig. 6). While the qualitative evolution of the system 
is similar to that shown in Fig. 1, there are some qualitative differences. The final 
obliquity is lower than in Fig. 1 (in excellent agreement with ref. 27), while the final 
AM is about 15% too large. Given that the trend in outcome we see with increasing 
initial AM (lower final obliquity and higher final AM) is in the opposite direction 
from the trend we see with increasing initial obliquity (higher final obliquity and 
lower final AM), it is likely that a more complete exploration of initial conditions 
will find higher-AM cases where both the final AM and obliquity are satisfactory. 

The apparent threshold obliquity for the instability that we find (between 60° 
and 65°) is below the critical obliquity of about 69° found by ref. 23; we understand 
that this is because of non-zero initial inclination of the Moon when encountering 
the Laplace plane transition. In all of our simulations, the Moon acquires a few 
degrees of inclination when crossing the inclination resonance just interior to the 
evection®; this early resonance is weak at low obliquities but strong when Earth's 
spin axis is tilted. As long as we assume a hydrostatic shape for the Moon, the 
Moon's obliquity is low and obliquity tides are weak, so this inclination largely 
survives until the Laplace plane transition. 

During the Laplace plane transition a large amount of energy is dissipated in 

the Moon. For example, during the evolution shown in Fig. 1, tidal heating in 
the Moon reaches 10'*-10'° W for several tens of millions of years. Owing to the 
prolonged nature of both the transition and subsequent cooling of the lunar mantle, 
the lunar crust would contain the signal of major thermal events that occurred tens 
of millions of years after lunar accretion; thus, the Laplace plane transition should 
be considered when interpreting the geochronology of lunar samples. 
Cassini state transition. The rotation states of most large planetary satellites are 
highly evolved through tidal forces raised by their parent planet. The spin periods 
of satellites are typically synchronized with the orbital period, and this is true for 
Earth’s Moon. Apart from spin period synchronization, tidal forces also damp any 
initial, so-called ‘free, obliquities of the moons. The most energetically stable final 
state for satellite obliquities is one where the spin axis maintains a constant angle to 
the moon’s orbital plane. Since both the satellite's spin axis and the orbital plane are 
precessing (the spin axis precesses around the orbital plane, which precesses around 
the Laplace plane), the most stable final obliquity is one matched to inclination 
and the precession rates in such a way that the spin axis, orbit normal and Laplace 
plane normal all stay in the same plane during their precession. This arrangement is 
called a Cassini state, and Cassini states can be calculated using equation (7)**. The 
two solutions relevant here are Cassini state 1, which the Moon will occupy when 
its spin precession rate is faster than the orbital precession, and Cassini state 2, 
which is the only possible solution when the spin precession is slower than orbital 
precession (as is the case at the present day). The Moon is thought to have crossed 
the Cassini state transition at about 33R_z when Cassini state 1 ceased to exist and 
the Moon had to shift to Cassini state 2 (ref. 18). 

The largest discrepancy between our R-SISTEM integration of the Cassini state 
transition (Fig. 4) and the semi-analytical solutions (Extended Data Fig. 1) is the 
fact that the Moon is in non-synchronous rotation from 29.7Rg to about 35R¢ in 
the numerical simulation. To explore this phenomenon in more detail, we ran 516 
short simulations with initial conditions on a grid in a and i covering the Cassini 
state transition (Extended Data Fig. 7). We found that the Moon settles into a stable 
sub-synchronous rotation for a wide range of initial conditions with a= 27-37Rg. 
In all of the grid simulations, the Moon was initially non-synchronous, so our 
results show when the synchronous rotation will not be re-established after 
being broken, rather than that the synchronous rotation is unstable, in the 
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regions covered with orange crosses in Extended Data Fig. 7. When it comes to 
intrinsic stability of continued synchronous rotation, we find major differences 
between Cassini states 1 and 2. Cassini state 1 is by itself stable all the way until its 
disappearance at a= 31-34Rg (the exact distance depends on inclination), unless 
disturbed by an outside influence (for example, a wobble resonance or an impact). 
On the other hand, we find that Cassini state 2 is intrinsically unstable if its equilib- 
rium obliquity is above 58.15 (Extended Data Fig. 8), a result well established in the 
literature°*°?, but not previously relevant for the Moon in the low-inclination case. 
We also find that once Cassini state 2 becomes stable, the Moon may still occupy 
a sub-synchronous rotation state just short of synchronous until it is disturbed by 
a resonance or an impact. 

We also find a number of resonances that excite the Moon's rotation and can 
lead to long-term chaos (red crosses in Extended Data Fig. 7). Some of these are 
probably associated with secondary resonances found by ref. 54. Here we will 
concentrate on three features seen in Fig. 4: the wobble resonance at 29.7°, the 
1:2 spin-orbit resonance at 31° and several resonances at 34—35°. The wobble 
resonance at 29.7° breaks the synchronous rotation by inducing large non-principal 
axis rotation within the Moon. Extended Data Fig. 9 shows how the wobble 
amplitude grows as the Moon approaches the resonance (at a tidal evolution rate 
accelerated 100 times), and Extended Data Fig. 10 shows the resonance crossing 
itself, integrated at the nominal tidal evolution rate. This resonance is caused by 
the commensurability between Earth’s heliocentric motion and the lunar libration 
in longitude, the frequency of which is given by ref. 55 
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Currently, the period of the libration in longitude is 1,056.1 days (ref. 55). As long 
as the Moon’s shape is constant, this period is proportional to the orbital period, 
so the semimajor axis at which this period is one year is 


(23) 
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which is exactly where it occurs in our numerical simulation. After the wobble 
disrupts synchronous rotation, the Moon settles into a sub-synchronous 
high-obliquity state, consistent with Extended Data Figs 7 and 8. 

At about 31Rg, the Moon is briefly captured into a 1:2 spin-orbit resonance. 
This resonance can be stable at high eccentricities**, but we never observed long 
periods of capture in our high-obliquity, low-eccentricity cases. At 34-35Rg the 
Moon encounters several resonances which disrupt sub-synchronous rotation 
and the Moon settles into (the by now stable) Cassini state 2. We think that these 
resonances are related to the 1:3 secondary secular resonance, which was found 
to intersect with Cassini state 2 at about this geocentric distance”. Clearly more 
work is needed to identify these resonant features and explore the full diversity 
of the Moon's past spin-orbit dynamics. Here we assumed that the Moon already 
had its present shape at the time of Cassini state transition, which is a reasonable 
assumption”, but other shape histories are possible. Also, beyond 30Rx in Fig. 4, 
lunar tidal evolution is accelerated 10 times over nominal, and the Moon would 
probably be affected by some of the later resonances more strongly if the evolution 
was integrated at the nominal rate. Additional factors that we ignored are impacts 
that can disrupt the Moon's rotational state*® and the core—-mantle interaction, 
which may have been important in generating the ancient lunar magnetic field*”. 

Once the Cassini state is re-established in Fig. 4, we can compare our numerical 
results with analytical estimates (see equations (1)—(6)). Our numerical code 
R-SISTEM damps lunar inclination from 16° at 35Rg to 9.2° at 40Rg. This is 
slightly below our nominal target of 10° at 40Rg (based on Extended Data Fig. 1), 
obtained by assuming a long-term-average Earth and the present Moon beyond 
40Rg (Extended Data Fig. 1). Also, R-SISTEM damps inclination slightly faster 
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for Quo = 60 than does the analytical model (switching to the analytical approach 
at 35Re would give us 10.9° at 40Rg). This is an inevitable consequence of the 
differences between the two approaches, but we are encouraged by the over- 
all convergence of the results. In the absence of any strong constraints on the 
timeline of the Moon's tidal evolution, we think that the history shown in Figs 1 
and 4 represents the best currently available explanation for the otherwise puzzling 
lunar orbital inclination. 

Code Availability. Computer programs used in this paper are available on request. 
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Extended Data Figure 1 | Semi-analytical model of the lunar tidal 
evolution. Evolution of lunar inclination (a) and obliquity (b) as the 
Moon evolves from 25Rg to 60Rz using our semi-analytical model 
(Methods subsection ‘Damping of lunar inclination by obliquity tides’). 
Initial inclinations were chosen so that the final lunar inclination was the 
current value of about 5°, while the obliquity was calculated assuming 

the Moon was in a Cassini state (jumps between 30Rz and 35R, are due to 
the transitions between Cassini states 1 and 2)!*. Love numbers were set at 
their current values (kop = 0.3, ko, =0.024)°°, and the current lunar shape 
was assumed. The black and red lines plot the solutions for Qu = 10,000 
and Qy = 38 (current value), respectively, while Qz was in the range 33-35 
range (it was adjusted so that a semimajor axis of 60R_ was reached after 
4,500 Myr). The blue line plots a history assuming Quy = 100 interior to 
40Rz, and Quy = 38 after the Moon passes that distance. The black line 
closely resembles the results of studies!!!” that neglected lunar obliquity 
tides, while the other two curves indicate that the past lunar inclination 
must have been much larger owing to lunar obliquity tides. 
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Extended Data Figure 2 | Lunar tidal evolution following planetesimal 
encounters. Evolution of lunar inclination (a) and eccentricity (b) 
following the excitation of the lunar inclination by encounters with 
planetesimals as proposed by ref. 3, using our semi-analytical model. 

The two sets of initial conditions are for the state a= 47Rz, i=5.8° featured 
in ref. 3 (Fig. 1), and a possible outcome with a more excited inclination 
i= 10° (also at a=47Rg). The initial eccentricities were estimated as 

e = 2sini. The black lines show the evolution assuming Qu = 38 and 

Q: = 34, with the current Love numbers. The red line plots the evolution 
for Quy = 100 and the blue line the evolution for Qg = 20. The circle 
symbol plots the current inclination and eccentricity of the lunar orbit. 
No combination of tidal parameters can simultaneously match both the 
current lunar inclination and eccentricity at the same time. A small 

Qm combined with high e also keeps the Moon from reaching 60Rg 

(top two lines). Decreasing Qg also does not help, as stronger Earth tides 
further increase the lunar eccentricity (blue line). 
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Extended Data Figure 3 | A snapshot of the simulation shown in 

Fig. 1 taken at 12.8 Myr. The left-hand panels show eccentricity (a), lunar 
inclination and Earth's obliquity with respect to the ecliptic (b), and the 
argument of perigee of lunar orbit, with the ecliptic as fundamental plane 
(c) versus time over a 500-year period, while the right-hand panels plot 
lunar eccentricity (d) and inclination (e) against the Moon’s argument of 
perigee (over the whole period of 30,000 yr). The eccentricity is clearly 
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correlated with the argument of perigee, as expected for Kozai-type 
perturbations***’. Rapid mutual precession of Earth’s spin axis and the 
plane of the Moon’s orbit, which are substantially inclined to one another 
(and to the ecliptic plane), clearly affects both the eccentricity and the 
perigee precession. This secular behaviour is periodic, with exactly three 
inclination cycles per one eccentricity cycle, which corresponds to half of 
the period of precession of the argument of perigee. 
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Extended Data Figure 4 | A snapshot of Qz/k2,5 = 200 simulation shown 
in Fig. 1 (black line) at 34.6 Myr. At this time, eccentricity excitation (a) 
and the associated variation of Earth’s obliquity (b) are not due to Kozai 
perturbations, but owing to the slow-varying near-resonant argument 

W= 30+ 2w — 3y(c), where 92 and y are longitudes of the lunar ascending 
node and Earth's vernal equinox, respectively, and w is the Moon’s 
argument of perigee. This near-resonant interaction is responsible for the 
substantial reduction of Earth's obliquity seen in Fig. 1. 
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Extended Data Figure 5 | Early tidal evolution of the Moon with 
Qe/k2,5 = 200 throughout the simulations. Black lines are for the case 
with Qu/ko,m = 200, while grey lines plot the simulation with Qg/k2,z = 50. 
The most notable aspects of these simulations are low final obliquities of 
Earth (e) and a final AM of the Earth-Moon system (b) in excess of the 
current value of 0.35a,, |GM3Rx. a, cand d plot the semimajor axis, 
eccentricity and inclination of the Moon. 
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Extended Data Figure 6 | Early tidal evolution of the Moon with Earth 
initially having a 2-h spin period. This is equivalent to the system having 
twice the current AM. The grey lines plot a simulation in which the tidal 
properties of Earth and the Moon were Qg/k2.5 = Qu/k2.m = 100 
throughout. The black line shows a simulation branching at 30 Myr 

by changing Qz/k2x to 200. Although the final obliquity of Earth (e) is 
correct, the final AM of the Earth-Moon system (b) somewhat exceeds 
the current value of 0.35  ag,/J GM3Re. a, cand d plot the semimajor axis, 
eccentricity and inclination of the Moon. 
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Extended Data Figure 7 | Map of lunar rotational dynamics close to the 
Cassini state transition. Outcomes of 512 simulations probing the end 
states of initially very fast and very slow lunar rotations for 16 different 
lunar semimajor axes a and 16 different lunar inclinations i. Simulations 
were run for 1 Myr, except for the rightmost three columns, which were 
followed for 3 Myr. Each a—i field is described by two symbols, one each 
for initial rotations of 127 rad yr~! and 381 rad yr !. Green and blue 
boxes indicate synchronous rotation in Cassini states 1 and 2, respectively. 
Crosses indicate non-synchronous rotation with stable obliquity, with large 
orange crosses indicating sub-synchronous rotation, and small magenta 
crosses plotting super-synchronous states. Red crosses signify variations 
in obliquity above 1° during the last 50 kyr of the simulation (indicating 
excited or chaotic spin axis precession). 
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Extended Data Figure 8 | Lunar obliquity close to the Cassini state 
transition. Obliquities for four ‘slices’ in inclination (at 5°, 10°, 15° and 
20°) from the grid of short simulations shown in Extended Data Fig. 7 
(solid red and magenta lines with points; the obliquities and inclinations 
are in the same order at far left and far right). When two different 
simulations for the same a and i differed in outcome, we chose the solution 
within the Cassini state, if available. The blue dashed lines plot the relevant 
Cassini states calculated using analytical formulae, while the black dashed 
line at 58.15° plots the upper limit for stable obliquities in the relevant 
Cassini state. Although the numerical and analytical results agree at the 
smallest and largest semimajor axes, the large discrepancies in between 

are due to non-synchronous rotations being dominant at the Cassini state 
transition. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


LETTER 


Lunar libration angle (°) NPA spin (rad/year) 


Semimajor axis (R_) 


Extended Data Figure 9 | The Moon’s wobble as it approaches the 
annual resonance in Fig. 4. The rotation rate around the longest axis 

of the Moon (a) and the angle between the longest axis and Earth (b) 
during the first phase of lunar tidal evolution (red points in Fig. 4) within 
29.7Rx, where we accelerated the tidal evolution by a factor of a hundred. 
The wobble is clearly building up as the Moon is approaching the 
resonance between its free wobble and Earth's orbital period at about 
29.7Rg. The growth in lunar libration angle is more influenced by 
increasing lunar obliquity (Fig. 4b) than the increase in the amplitude 

of the wobble. 
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Extended Data Figure 10 | Passage through the annual resonance of the lunar 
free wobble in Fig. 4. Lunar obliquity (a), spin rate (b), rotation rates around the 
longest and shortest principal axes (c), and the angle between the Moon's longest 
axis and Earth (d) during the first 1 Myr of the ‘blue’ segment of lunar tidal 
evolution in Fig. 4 (which was simulated at the nominal rate for tidal evolution). 
The free wobble (tracked by grey points in c) experiences a resonance at about 
330 kyr, breaking the Moon’s synchronous rotation. Since the Moon is close to the 
Cassini state transition, it cannot evolve back into Cassini state 1 and it settles into 
a non-synchronous high-obliquity state*®. 
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Nanoscale thermal imaging of dissipation in 


quantum systems 


D. Halbertal!, J. Cuppens!*, M. Ben Shalom?, L. Embon!+, N. Shadmi®, Y. Anahory’, H. R. Naren!, J. Sarkar!, A. Uri!, Y. Ronen!, 
Y. Myasoedov!, L. S. Levitov®, E. Joselevich®, A. K. Geim** & E. Zeldov! 


Energy dissipation is a fundamental process governing the dynamics 
of physical, chemical and biological systems. It is also one of the 
main characteristics that distinguish quantum from classical 
phenomena. In particular, in condensed matter physics, scattering 
mechanisms, loss of quantum information or breakdown of 
topological protection are deeply rooted in the intricate details of 
how and where the dissipation occurs. Yet the microscopic behaviour 
of a system is usually not formulated in terms of dissipation because 
energy dissipation is not a readily measurable quantity on the 
micrometre scale. Although nanoscale thermometry has gained 
much recent interest!~'’, existing thermal imaging methods are 
not sensitive enough for the study of quantum systems and are 
also unsuitable for the low-temperature operation that is required. 
Here we report a nano-thermometer based on a superconducting 
quantum interference device with a diameter of less than 50 
nanometres that resides at the apex of a sharp pipette: it provides 
scanning cryogenic thermal sensing that is four orders of magnitude 
more sensitive than previous devices—below 1 1K Hz~””. This non- 
contact, non-invasive thermometry allows thermal imaging of very 
low intensity, nanoscale energy dissipation down to the fundamental 
Landauer limit!*!* of 40 femtowatts for continuous readout of a 
single qubit at one gigahertz at 4.2 kelvin. These advances enable the 
observation of changes in dissipation due to single-electron charging 
of individual quantum dots in carbon nanotubes. They also reveal a 
dissipation mechanism attributable to resonant localized states in 
graphene encapsulated within hexagonal boron nitride, opening the 
door to direct thermal imaging of nanoscale dissipation processes 
in quantum matter. 

Investigation of energy dissipation on the nanoscale is of major 
fundamental interest for a wide range of disciplines, ranging from 
biological processes, through chemical reactions, to energy-efficient 
computing! >. Study of dissipation mechanisms in quantum systems 
is of particular importance because dissipation demolishes quantum 
information. In order to preserve a quantum state, the dissipation 
has to be extremely weak and hence hard to measure. As a figure of 
merit for detection of low-power dissipation in quantum systems’®, 
we consider an ideal qubit operating at a typical read-out frequency 
of 1 GHz. Landauer’s principle states that the lowest bound on energy 
dissipation in an irreversible qubit operation is Ej =kgT1n2, where kg 
is Boltzmann’s constant and T is the temperature!”®, At T=4.2K, 
Ey=4 x 10~”°J, several orders of magnitude below both the 107!°J 
of dissipation per logical operation in present-day superconducting 
electronics and the 10~'°J in CMOS devices!”°. Hence the power 
dissipated by an ideal qubit operating at a read-out rate of f= 1 GHz 
will be as low as P= Eof = 40.2 fW. The resulting temperature increase 
of the qubit will depend on its size and the thermal properties of the 
substrate. For example, a 120nm x 120 nm device on a 1-j1m-thick 
SiO2/Si substrate dissipating 40 fW will heat up by about 3 1K (Fig. 1). 
Such signals are several orders of magnitude below the best sensitivity 


(several mK Hz") of any of the existing imaging techniques!!° 


(Fig. 1a): these include radiation-based thermometry using infrared® 
(IR), fluorescence in nanodiamonds*”*, Raman spectroscopy’, scan- 
ning near-field optical microscopy'® (SNOM), or the electron beam 
induced plasmons in transmission electron microscopy® (TEM), 
and atomic force microscopy (AFM) equipped with thermocouple 
(TC SThM) or resistive (Res. SThM) thermometers!°"!5. Moreover, 
none of the existing imaging techniques has been demonstrated to 
operate at the low temperatures that are essential for study of quantum 
systems. 

Superconducting junctions are commonly used as highly sensitive 
thermometers””’, relying on the strong temperature dependence of their 
critical current I(T), which in the vicinity of the critical temperature 
T- can be approximated by [.(T) © Io(1 — T/T). Conventional junc- 
tion configurations, however, are not suitable for scanning probe 
thermometry owing to their planar geometry and strong thermal 
coupling to the substrate. 

Here we introduce a non-contact cryogenic scanning probe micros- 
copy technique based on a superconducting quantum interference 
device (SQUID) positioned on a tip”””’. We utilize a novel approach in 
which either a single Pb (7. = 7.2 K) superconducting junction or a 
SQUID is fabricated on the apex of a sharp quartz pipette (SQUID on 
tip, SOT), and can be made as small as a few tens of nanometres in 
diameter (Supplementary Information section $1). Figure 1b shows a 
scanning electron microscopy (SEM) image of a thermometer using 
this SQUID-on-tip technique (that is, a SOT) with an effective 
diameter of 46nm, as determined from its quantum interference 
pattern. Similar SOT devices have previously been reported” to be 
extremely sensitive magnetic sensors, reaching a magnetic spin 
sensitivity below 0.4 j1y Hz”. The electrical characteristics of the SOT 
(Fig. 1c) show the current through the sensor (Iisor) versus externally 
applied bias current (Ipias) at various temperatures (see electrical diagram 
in Supplementary Fig. 1c). For sor < I, essentially all the applied 
current flows through the tSOT, while at higher bias a substantial part 
of the current diverts to a parallel shunt resistor. When biasing the SOT 
at Ibias > I- (dashed line in Fig. 1c), the temperature dependence of 
Iisor(T) gives rise to a thermal response diisor/dT = —9.5 1A K7! 
(Supplementary Fig. 1b); when this is combined with the very low white 
noise of the tSOT ($}/?=8.3 pA Hz~'/?; Supplementary Fig. 1c) it 
translates into a remarkably low thermal noise of S a *—870nK Hz" 1/7, 
an improvement of four orders of magnitude over existing thermal 
imaging methods (Fig. 1a). 

To enable effective thermal imaging, the thermal properties of the 
sensor, including its coupling to the sample, are crucial. For non- 
invasive imaging, the thermal resistance between the sensor and the 
investigated device, Rsq, has to be considerably larger than the thermal 
resistance between the device and the bulk of its substrate, Ray (Fig. 1d). 
On the other hand, in order for the temperature of the sensor T; to 
accurately describe the local temperature of the device Tj, a high 
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Figure 1 | tSOT characteristics and performance. a, Sensitivities (left- 
hand axis) of different thermal imaging techniques (blue) and of the tsOT 
(red) versus their spatial resolution (bottom axis). Green diamond, the 
measured temperature increase (right-hand axis) due to 40 fW dissipation 
(taken from e) corresponding to Landauer’s limit for qubit operation 

at 4.2 K at 1 GHz along with the theoretical scaling of the temperature 
(right-hand axis) with the qubit size (dashed line, top axis). See main text 
for definitions of the thermal imaging techniques. b, SEM image of the 
46-nm effective diameter Pb tSOT. c, Electrical characteristics of the tSOT 


thermal resistance R,; is required between the sensor and its support 
structure. The resulting overall requirement of R,, >> Rsa >> Rap is 
usually hard to achieve in AFM-type scanning thermal probes!”, 
leading to invasive in-contact imaging!*“. In a tSOT, in contrast, R,. is 
extremely high, owing to the unique nanoscale cross-section geometry 
of the device, giving rise to a quantum-limited phonon thermal 
conductivity”, and to the absence of electronic heat conductivity along 
the superconducting leads. As a result, our smaller tSOT attains 
Rs 10'!K W7! (see Supplementary Information section $6) as 
compared to Ray 107K W“? for a 120nm x 120nm device on a 
SiO,/Si substrate at 4.2 K (Fig. le). The corresponding optimal Req of 
108 to 10!°K Wis readily achieved in our configuration using a few 
mbar of He exchange gas to tune Rsq (Supplementary Fig. 8). These 
features permit non-contact sensing of the Ty of the sample with 
nanoscale resolution (Supplementary Fig. 2 and Supplementary 
Information sections S3, $4). 

To characterize the thermal sensitivity of the scanning tSOT, we 
position the sensor above a 120-nm-wide Cu nanowire on a SiO)/Si 
substrate (Fig. 1d and Supplementary Fig. 7); the nanowire is carrying 
an alternating current (a.c.), which results in an a.c. temperature 
modulation 7, of the nanowire at 13.1 kHz. By changing the current 
amplitude, we measured tSOT T,. versus the power P dissipated per 
square 120nm x 120nm segment of the wire (Fig. le). Since T,,. < Ty, 
where T;, = 4.2 K is the thermal bath temperature, the measurement is 
in the small signal limit and hence T,, is linear in P as expected, reaching 
a noise level of T,-+440 nK at P= 6fW. The green triangle in Fig. le 
shows the Landauer dissipation limit of P= 40.2 fW ofa qubit operating 
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for temperatures ranging from 4.2 K to 7.2 K with marked values of the 
critical current J, at representative temperatures. d, Schematic drawing 

of the measurement set-up and a simplified effective thermal circuit. 

See main text for definitions of symbols. e, Measurement of the tSOT 
temperature T,. at 13.1 kHz above a 120-nm-wide Cu nanowire with sheet 
resistance 0.46 () per square versus the a.c. power P dissipated by a variable 
I,, in a 120nm x 120nm unit segment of the nanowire. The dashed line 
shows a linear fit (with slope Ra, =6.8 x 10’ K W~!) anda noise floor of 
Tac 440 nK below P= 6 fW using a lock-in amplifier time constant of 1s. 


at 1 GHz at 4.2 K, which gives rise to Ta, =2.6 1K in our sample. This value 
is indicated in Fig. 1a (green triangle) along with the expected inverse 
scaling of the qubit temperature with its area (dashed line). 

By applying a proper combination of magnetic field and bias, the 
tSOT can be tuned to have both magnetic field and thermal sensitivities. 
As the Oersted field generated by the transport current in the sample 
is linear in current while the dissipation is quadratic, the resulting mag- 
netic field signal B%* generated by the a.c. current J,. at frequency f can 
be imaged simultaneously with the thermal signal T,, that will be 
present at frequency 2f, as shown in Supplementary Fig. 11. 

We used the tSOT for nanoscale thermal imaging of quantum matter. 
Figure 2a, b shows 7,, images of two single-walled carbon nanotubes 
(CNTs) carrying an alternating current of few nA. Each CNT is wound 
into a loop”®, as outlined by the dotted trajectories and shown by SEM 
images in Fig. 2d, e. The thermal signal in Fig. 2a tracks the CNT, 
revealing the current-driven dissipation along the entire length of 
the CNT. Surprisingly, and in contrast to the above, Fig. 2b shows 
an absence of heating in the circular part of the loop. The T,, image 
thus reveals that the applied current bypasses the loop, exposing an 
electrically shorted junction between the two crossing sections of the 
CNT. This observation illustrates the capacity of the tSOT for fault 
detection in operating nanodevices. 

A striking feature evident in Fig. 2a, b is ring-like fine structure: a 
zoomed-in T,, image of one of these is shown in Fig. 2c. These features 
resemble the Coulomb blockade rings observed in scanning gate 
microscopy”, in which the conductance through a quantum dot is 
measured as a function of the position of a conducting AFM tip. In the 
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Tae (UK) Figure 2 | Thermal imaging of single-walled 
260 330 400 CNTs and scanning gate thermometry of 
LE 


quantum dots. a, b, Thermal images of two 
CNT devices with loop geometry carrying I< 
of 12 nA (a) and 3 nA (b), revealing an electrical 
short at the loop intersection point in b. The 
colour scale used for T,, is shown at the top 
of each image. The T,,(x, y) was acquired by a 
tSOT of 104-nm diameter at scanning heights 
of about 65 nm (a) and 150 nm (b). The ring- 
like structures in T,, result from variations in 
dissipation due to modulations in the single- 
electron charging of individual quantum dots 
as described for c. c, Zoomed-in scanning 


250 


gate thermometry image of a single quantum 
dot in the area boxed in d at a scan height of 
about 35 nm (7), scale at top). Variations in 

the electrochemical potential of the quantum 
dot induced by the scanning tSOT give rise 

to changes of the temperature of the dot at 

the Coulomb blockade peak conditions along 
the equipotential ring-like contour. d, e, SEM 
images of the devices in a and b. Boxed area in 
e is shown in Supplementary Fig. 15. f, Line-cut 


case of scanning gate microscopy, the characteristic equipotential rings 
(corresponding to the periodic conditions of Coulomb blockade peaks 
governed by single-electron charging of the quantum dot’) originate 
from the tip acting as a local gate. Since the tSOT is conducting, it 
can also serve as a nanoscale scanning gate. The disordered substrate 
potential in our long CNTs on the SiO./Si substrate gives rise, however, 
to carrier localization and formation ofa series of electronic quantum 
dots, resulting in CNT resistances in excess of 10 MQ. As a result, no 
detectable change in CNT conductance is observed while scanning the 
tSOT. The unique feature of the tSOT, however, is that in addition to 
functioning as a scanning gate it simultaneously operates as a nanoscale 
thermometer that detects minute changes in the local dissipation result- 
ing from its own controllable local gating. The 7, ring in Fig. 2c witha 
cross-sectional width of 20nm (Fig. 2f) thus reveals microkelvin-range 
changes of the temperature of a quantum dot resulting from the mod- 
ulation of its single-electron Coulomb blockade conductance peaks by 
the scanning tSOT (see Supplementary Information sections $13, S14 
for additional details). This novel ‘scanning gate thermometry’ thus 
adds another functionality to the tSOT, allowing nanoscale manipula- 
tion of the potential and study of the induced changes in the local scat- 
tering processes and dissipation that are inaccessible by other methods. 

One of the topics of great interest in the study of electronic transport 
at low temperatures is the non-equilibrium heating of the electron bath 
and the proliferation of hot carriers due to poor electron-phonon 


across the line marked on right in c. 
X (nm) 


coupling. Graphene offers a unique system in which the typical 
distance for such carrier-lattice cooling can exceed the dimensions of 
the device** owing to the anomalously long lifetime of the hot carriers”. 
Electron cooling rates are believed to be sharply enhanced in the 
presence of disorder, but the precise mechanism of such enhancement 
is poorly understood”. 

To probe microscopic dissipation mechanisms in graphene, we 
performed scanning thermometry in high-mobility graphene encap- 
sulated in hexagonal boron nitride (hBN). We used a washer-shaped 
device to which a direct current was applied between top and bottom 
constrictions, as illustrated in Fig. 3a (see Supplementary Information 
section $12 for details). Figure 3b shows the corresponding scanning 
gate thermal image, revealing a startling visualization of dissipation 
processes, which are manifested in a complex structure of two ‘neck- 
laces’ of rings along the inner and outer edges of the device. The sharp 
ring-like structures have the same origin as the ring-like patterns in the 
CNTs in Fig. 2a—c, and reveal the presence of localized resonant states 
at the edges of the graphene; these states are acting as atomic-scale heat 
beacons. The tSOT functions as a top gate, which tunes the potential of 
the localized states into resonance when the combination of its distance 
and the voltage Visor applied between the tip and the sample match 
the resonance conditions. This gives rise to sharp rings of enhanced 
temperature, as described in detail in Supplementary Information 
sections $13 and S15. Consequently, the ring radius R changes 


Figure 3 | Scanning gate thermometry of 
dissipation at localized resonant states at 
graphene edges. a, Optical image of 
hBN/graphene/hBN structure patterned into 

a washer shape (bright) with inner and outer 
diameters of 2 1m and 6m, respectively. Arrows 
show where the direct current Ig, was applied. 

b, Scanning a.c. gate thermometry 7) image of 
the boxed area in a in the presence of Ig, =6 1A 
at a carrier concentration of 10!*cm~?; the T,, 
colour scale is shown at right. The T,<(x, y) 

was acquired by a tSOT of 100-nm diameter 

at 4.2 K at a height of 40 nm in the presence 

of an a.c. potential of 200 mV applied to the tip 
(see Supplementary Information sections 
$13-S15). The dashed lines mark graphene edges. 
The ‘necklace’ of rings reveals the presence of 
resonant states along the edges of graphene that 
serve as local centres of energy dissipation. 
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on varying Visor, as demonstrated in Supplementary Fig. 16 
and Supplementary Video S1, or on changing the tip height, as 
demonstrated in the case of a CNT in Supplementary Fig. 15. 

Despite the similarity between the ring-like structures observed 
in CNTs and in graphene, their microscopic origin appears to be 
different. For CNTs, the disorder potential creates quantum dots that 
confine discrete electronic states. Since the conductance of the quan- 
tum dot is sharply enhanced under resonant conditions, the Coulomb 
blockade staircase gives rise to the sequence of concentric rings of 
enhanced local temperature in the presence of current flow through 
the CNT (Supplementary Fig. 15). In our hBN/graphene/hBN 
devices, in contrast, the hot electrons flow mainly in the bulk of the 
graphene, while electron-lattice cooling occurs predominantly at 
defect states at graphene edges. Such defects can be formed by the 
vacancies and adatoms produced at the exposed graphene edges dur- 
ing the etching process. Spatially localized electronic states originat- 
ing from such defects, with energies pinned to the Dirac point*!, were 
recently revealed by scanning tunnelling microscopy*’. Localized 
states can considerably enhance electron-phonon coupling, thus pro- 
viding a local drain for the excess energy of the hot electrons*°. Since 
each defect creates a single resonant electronic state*!, only one ring 
should be observed around each defect with radius R that depends 
on Visor as shown in Supplementary Fig. 16 and Supplementary 
Video S1. Acting as gate-tunable spatially localized energy flow 
bottlenecks, these dissipation centres are likely to play a dominant role 
in the hot-carrier applications of graphene electronics. These obser- 
vations shed new light on electron-lattice cooling mechanisms in 
graphene. 

The observation of striking, spatially localized dissipation centres 
at graphene edges underscores the potential of the tSOT technique 
for uncovering the microscopic origins of dissipation in novel states 
of matter. Other systems of interest are topologically protected 
surface states, edge states in quantum spin and anomalous quantum 
Hall systems, and surface states in Weyl semimetals. By choosing 
appropriate superconducting materials””?>~°, it should be possible 
to extend the operating temperature range of the tSOT from tens of 
millikelvin to tens of kelvin (Supplementary Fig. 1d), thus allowing 
investigation of a wide range of quantum systems. In addition, the 
operation of the tSOT at elevated magnetic fields combined with 
the multifunctional abilities of magnetic sensing and scanning gate 
thermometry opens the way to nanoscale investigation and imaging 
of intricate thermoelectric and thermomagnetic phenomena. These 
include the Nernst effect, the thermal Hall effect, thermoelectric 
nanoscale cooling and quantum heat conductance. 
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Intrinsically stretchable and healable semiconducting 
polymer for organic transistors 


Jin Young Oh!*, Simon Rondeau-Gagné!*+, Yu-Cheng Chiu!**+, Alex Chortos!, Franziska Lissel!, Ging-Ji Nathan Wang!, 
Bob C. Schroeder!+, Tadanori Kurosawal, Jeffrey Lopez', Toru Katsumata!”, Jie Xu!, Chenxin Zhu’, Xiaodan Gu'*, Won-Gyu Bael, 
Yeongin Kim?, Lihua Jin°+, Jong Won Chung", Jeffrey B.-H. Tok! & Zhenan Bao! 


Thin-film field-effect transistors are essential elements of 
stretchable electronic devices for wearable electronics’. All of the 
materials and components of such transistors need to be stretchable 
and mechanically robust**. Although there has been recent progress 
towards stretchable conductors*°, the realization of stretchable 
semiconductors has focused mainly on strain-accommodating 
engineering of materials, or blending of nanofibres or nanowires 
into elastomers”"|!. An alternative approach relies on using 
semiconductors that are intrinsically stretchable, so that they can 
be fabricated using standard processing methods’. Molecular 
stretchability can be enhanced when conjugated polymers, 
containing modified side-chains and segmented backbones, are 
infused with more flexible molecular building blocks!*4, Here we 
present a design concept for stretchable semiconducting polymers, 
which involves introducing chemical moieties to promote dynamic 
non-covalent crosslinking of the conjugated polymers. These 
non-covalent crosslinking moieties are able to undergo an energy 
dissipation mechanism through breakage of bonds when strain 
is applied, while retaining high charge transport abilities. As a 
result, our polymer is able to recover its high field-effect mobility 
performance (more than 1 square centimetre per volt per second) 
even after a hundred cycles at 100 per cent applied strain. Organic 
thin-film field-effect transistors fabricated from these materials 
exhibited mobility as high as 1.3 square centimetres per volt per 
second and a high on/off current ratio exceeding a million. The 
field-effect mobility remained as high as 1.12 square centimetres 
per volt per second at 100 per cent strain along the direction 
perpendicular to the strain. The field-effect mobility of damaged 
devices can be almost fully recovered after a solvent and thermal 
healing treatment. Finally, we successfully fabricated a skin-inspired 
stretchable organic transistor operating under deformations that 
might be expected in a wearable device. 

Incorporation of dynamic non-covalent crosslinking between flexible 
polymer chains is an important method of achieving high stretchability 
and self-healing properties'*!°. The dynamic bonds can easily be broken 
to allow energy dissipation upon strain, thus making the system more 
tolerant of strain and mechanical stimuli. Moreover, the bonds can 
be reformed to recover the initial mechanical property and self-heal. 
Among the different types of dynamic bonds, hydrogen bonds are parti- 
cularly suitable for skin-inspired electronics owing to their spontaneous 
formation and healing ability!” °°. Such polar substituents in organic 
semiconductors have received increased attention in recent organic 
electronics research, but mostly in producing self-assembled nanostruc- 
tures instead of using them to improve mechanical properties”!. 


2,6-pyridine dicarboxamide (PDCA) was chosen to introduce 
hydrogen bonding within the flexible polymer backbone since this unit 
contains two amide groups possessing moderate hydrogen-bonding 
strength, allowing the formation of a polymer network without drasti- 
cally increasing the tensile modulus of the material”*”*. Previous work 
has indicated that introducing a small fraction of non-conjugated 
units to the polymer backbone does not noticeably degrade the charge 
transport mobility”. Here we introduce alkyl spacers to enhance the 
flexibility of the dynamic moieties. Semiconducting polymers incor- 
porating different ratios of non-conjugated PDCA moiety were thus 
synthesized (P1 to P4; structures shown in Fig. 1a). To confirm the 
presence of hydrogen bonds, both X-ray crystallography and nuclear 
magnetic resonance (NMR) experiments (Extended Data Figs 1 and 2, 
and Supplementary Figs 1, 2 and 11) were used to ascertain hydrogen- 
bond formation in PDCA-containing model compounds. 

Initial evaluation of the electrical properties was performed using 
organic thin-film field-effect transistor (OTFT) devices (Fig. 1b). 
For devices fabricated with P2 and P3, the measured mobilities were 
both >1cm? V~'s~'. Even with P4 (containing 20 mol% of non- 
conjugated monomers), its field-effect mobility remained as high as 
0.58cm? V~! s~! (Supplementary Fig. 12). To understand further the 
contributions from the disruption of conjugation and incorporation 
of hydrogen bonding on the mechanical properties of the polymers, 
we next measured the elastic modulus of the polymers (Fig. 1c and 
Supplementary Fig. 13). All the polymers with PDCA moieties exhibited 
a lower elastic modulus than that of the fully conjugated polymer (P1). 
Although intermolecular hydrogen bonding essentially cross-links the 
polymers and thus should increase the elastic modulus of the polymer 
film”, it appears that disruption of conjugation—that is, reducing the 
rigidity of the conjugated polymer backbone—has a larger effect on 
the elastic modulus of the polymer semiconductor film. We attributed 
the reduction in the measured elastic modulus to the changes in film 
morphology, such as the increase in the amorphous fraction in the 
polymer film or the decrease in relative crystallinity (Fig. le), and also 
a slight reduction of the average size of crystallites (Extended Data 
Fig. 3c), as characterized by grazing incidence X-ray diffraction analysis 
(Supplementary Fig. 14). 

To gain further insight into the molecular level changes during 
stretching, the degree of polymer chain alignment under strain was 
measured using polarized ultraviolet-visible spectroscopy and quan- 
tified using dichroic ratios (Fig. 1d and Supplementary Fig. 16). The 
dichroic ratio of P1 initially increased slightly with strain owing to 
strain-induced chain alignment, but soon decreased to approximately 1; 
this was caused by chain relaxation due to crack formation upon 
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Figure 1 | Design of stretchable and healable semiconducting 

polymer OTFTs and their performance characterizations. a, Chemical 
structures of polymers P1 to P7 and general mechanism for enhancement 
of stretchability in conjugated polymers via dynamic bonding. The 
stretchable semiconducting polymers are based on 3,6-di(thiophen-2-yl)- 
2,5-dihydropyrrolo[3,4-c]pyrrole-1,4-dione (DPP) repeating units and 
non-conjugated 2,6-pyridine dicarboxamide (PDCA) moieties introduced 
directly in the polymer backbone as hydrogen bonding units. To 
qualitatively evaluate the stretching properties of the polymers, thin films 
are supported on polydimethylsiloxane (PDMS) substrates and stretched 
to different strain ratios (0% to 100% strain; ‘Re’ refers to the value after 
strain release). b, Field-effect mobility supp and on/off current ratios of P1 
to P4 as measured by conventional OTFTs (the source and drain electrodes 
are 40-nm-thick Au, the dielectric layer is 300-nm-thick SiO, and the gate 


increasing the strain from 30% to 100%. This observation indicates that 
P1 is not very stretchable and, indeed, we observed that cracks began 
to form even at low strains (5%-10%); these were further confirmed 
by both optical microscopy and atomic force microscopy (AFM). On 
the other hand, the dichroic ratio of P3 increased linearly to 1.8, as 
strain increased from 30% to 100%. This observation is consistent with 
our microscopic observations that no microscale cracks were formed 
for P3, even with strain as high as 100%, showing that polymer chain 
alignment by strain can be achieved (Supplementary Figs 17 and 18 
and Extended Data Fig. 3). 

Relative degree of crystallinity analysis was used next to examine 
the change of morphology under strain (Fig. le and Supplementary 
Figs 19 and 20). The relative degree of crystallinity of P1 decreased 
with strain from 0% to 50%, and plateaued from 50% onwards. This 
observation, combined with AFM studies, indicates that the tensile 
strain applied to the thin film was initially released through breakage 
of the crystalline regions until crack formation. In contrast, the relative 
degree of crystallinity of P3 decreased steadily upon applied strain from 
0% to 100%. Upon releasing the strain, we observed that the relative 
degree of crystallinity recovered only partially, indicating that some of 
its crystalline domains may be permanently altered or broken apart 
by the applied strain. This result, combined with the steady increase 
in dichroic ratio previously observed, suggests that P3 has multiple 
mechanisms for energy dissipation during strain: (1) stretching and 
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electrode is highly doped silicon substrate). c, Influence of the hydrogen- 
bonding moiety amount on elastic modulus and crack formation strain. 
Error bars in b (three batches of samples were prepared, and five OTFTs 
from each sample were measured) and c (one data point was obtained 
from each sample, and three batches of samples were prepared and 
measured) are mean +s.d. d, Dichroic ratios of P1, P3 and P5 under 
various strain levels. aj and a are absorption intensities measured 

with the polarization direction of light parallel and perpendicular to the 
stretching direction, respectively. e, Influence of the strain ratio on the 
relative degree of crystallinity extracted from peak (200) for both ‘parallel’ 
and ‘perpendicular’ directions of P1, P3 and P5. Owing to the formation 
of large cracks, the relative degree of crystallinity of P1 perpendicular to 
the strain direction was not measured. The semiconductor is 35 nm thick. 
(a.u., arbitrary units.) 


realignment of polymer chains in amorphous regions; (2) breaking of 
some of the crystalline domains; and (3) breakage of hydrogen bonding. 

The mechanism of energy dissipation through hydrogen-bond 
breakages has previously been observed'®. We confirmed this benefit 
of incorporating hydrogen-bonding sites as strain-releasing moieties 
by comparison with several reference polymers. P5 contains 10mol% 
of non-conjugated alkyl spacers, that is, no PDCA moieties, on its 
polymer backbone (Fig. 1a). Despite having a lower elastic modulus 
than P3, P5 can achieve a maximum strain of only 30% before cracks 
form, a value much lower than that of P3’s maximum strain (110%). 
Furthermore, the measured maximum dichroic ratio for P5 was found 
to be only about 1.16 at 30% strain, consistent with the strain value at 
the onset of crack formation. In addition, the relative crystallinity of P5 
was measured to be lower than P1, indicating that an increase in the 
fraction of amorphous regions has occurred within the polymer thin 
film, similar to P2-P4. 

However, when 100% strain was applied to the film, the relative 
degree of crystallinity decreased drastically from 0.80 to 0.35, indica- 
ting that the tensile energy in P5 film is dissipated primarily through 
the breaking of crystallites (Supplementary Fig. 21). This observation is 
consistent with a recent report, which did not show much improvement 
on fracture strain with conjugation breakage of another DPP polymer”. 
P6 is another reference polymer, in which the pyridine moiety was 
replaced by a benzene ring while the amide moieties remain (Fig. 1a). 
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Figure 2 | Charge transport of semiconducting polymers under applied 
strain and their healing properties. a, Fabrication process for OTFTs with 
stretched polymer thin films. b, c, Field-effect mobility pp as a function 
of various strains along the strain direction (b) and perpendicular to 

strain direction (c). d, e, Field-effect mobility versus number of stretching 
cycles performed along the strain direction (d) and perpendicular to 


P6 was observed to possess enhanced stretchability compared to pol- 
ymers P1 and P5 (Fig. 1c and Supplementary Fig. 18f), which again 
suggests the importance of the intermolecular hydrogen bonding for 
enhanced mechanical properties. However, as compared to polymers 
P2, P3 and P4 (all containing pyridine moieties), its stretchability 
was reduced. This suggests that the introduction of pyridine moieties 
may contribute to the enhanced mechanical properties either through 
participating in intra- and intermolecular hydrogen-bonding or through 
changing the morphology of the film. Finally, reference polymer P7 
was synthesized to remove the hydrogen-bonding amide hydrogens 
(Fig. 1a). P7 showed a decreased stretchability with an onset strain for 
crack formation of about 25% (Fig. 1c and Supplementary Fig. 18g). The 
above control experiments with reference polymers further confirm 
the important contribution of the dynamic non-covalent hydrogen- 
bond-forming moieties as strain-releasing elements for enhancing the 
stretchability of conjugated polymers. 

To evaluate the charge transport behaviour of our fabricated stretchable 
semiconducting polymers, OTFTs were fabricated and characterized 
(Fig. 2a). On applying strains up to 100%, we observed that the average 
field-effect mobility of the P3 device decreased moderately, from 
1.32cm? V~'s~! to0.11cm? V~!s~|, along the direction of applied strain. 
Furthermore, the mobility was observed to recover to 1.00cm? V-!s~! 
upon releasing the applied strain. When the strain is applied in a per- 
pendent direction, the mobility of the P3 device is maintained, that 
is, >1cm? V_'s~|, up to 100% strain and even after release. In con- 
trast, the mobility of the P1 device degraded substantially even at 25% 
applied strain, and was totally non-functional at 100% strain. Unlike P3, 
the mobility of the P1 device is unable to recover upon stress release. 


LETTER 


Heating 


Solvent 
vapour 


Nanocracks 


had 


Step (1) 


Damaged 


i 
A Pristine a 
=z 10 Solvent 2 18 
ere vapour/heat| > 
eee %~ 1.0 
5 o 
pest -8 uw 
3 10 : Solvent = 05 
= q0 ; vapour = 
vd — 
A Damaged 2 00 
10-12 = 


-60 -40 -20 0 


Gate voltage, V, (V) 


20 


strain direction (e). f, Representation of the treatments used for healing 
the conjugated polymer films. g, h, AFM phase image for damaged (g) 
and healed (h) film of P3. i, j, Transfer curves (i) and field-effect mobility 
(j) of damaged and healed P3 OTFTs. Error bars in b, c, d, e and j are 
mean + s.d. (three batches of samples were prepared, and three OTFTs 
from each sample were measured). 


Besides a single stretching event, we also performed rigorous 
repeated stretching cycle tests on the P3 devices at various strains 
(Fig. 2d, e). After stretching the device for 100 cycles between 0% and 
25% strain, the mobility along the stretching direction was decreased 
by about 40% (that is, from 1.2 cm? V~! s~' to 0.74cm? V~! s~') and 
was further reduced to 0.017 cm” V_! s~! when subjected to 100% 
strain. However, when the strain was along the perpendicular direc- 
tion, the device showed higher durability and robustness. Even after 
100 cycles up to 100% strain, only a 26% decrease in mobility was 
observed. Only a 19% decrease in mobility was observed for 100 cycles 
up to 25% strain (Supplementary Fig. 22). 

In addition to intrinsic stretchability, the healing ability enabled 
by the dynamic bonding represents another major advantage of the 
polymer design. Although the self-healing abilities of non-conjugated 
polymer networks cross-linked with hydrogen-bonding sites have 
been reported”, self-healing ability has not been investigated for con- 
jugated polymers. To facilitate efficient healing for damaged P3 films, 
post treatments via heat and solvent annealing are required to promote 
polymer chain movement (Fig. 2f). When applied independently, both 
thermal and solvent annealing greatly reduced the size and density of 
the nanocracks. However, the field-effect mobility was observed to be 
only slightly recovered (Fig. 2g-j and Supplementary Fig. 23). 

Solvent annealing applied in conjunction with thermal annealing 
promoted the most efficient healing of the polymer films. Specifically, 
we observed a complete disappearance of the nanocracks within the 
damaged films, and more importantly, an almost complete recovery 
of the average field-effect mobility to 1.13 cm* V~! s~! (Fig. 2i, j and 
Supplementary Fig. 23). Additionally, the dependence of dichroic ratio 


17 NOVEMBER 2016 | VOL 539 | NATURE | 413 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a Top contact CNT/PEDOT:PSS 
Bottom gate electrode 
(source and drain) 
DPP- / \ 
polymer 
(Semiconductor) 
CNT PDMS (dielectric) 
electrode 
(gate) 
Rubber 
substrate 
ce 10 d e 
< © 
o om x 
< = 0 
gs 5 ° 
3 = = 
= Oo ¥ 2 
£ i LS 
© -3 
4 Vg (0-60 V, 3 
step-10V) -0.96 44 —, 
0 -4li n 1 \ 
0.01 0.1 0.2 0.3 0.4 0.5 0.6 -60 -40 ~20 fe} 


Mobility, a¢¢ (em? V-" s~1) Drain voltage, V, (V) 


on strain completely recovered to a level similar to that of a pristine 
undamaged film, indicating the healed film has most probably returned 
to its original film morphology. In comparison, our various applied 
healing conditions did not improve the damaged P1 and P5 films 
(Supplementary Figs 24-28 and Extended Data Fig. 4). To the best 
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a, In situ stretching images of stretchable OTFT from 0% to 100% strain 
(scale bars in close-ups, 300 jm). b, ¢, Field-effect mobility jupp (b) and 
on/off current ratio and threshold voltage Vin (c) of a stretchable transistor 
during a stretching cycle. d, e, Mobility and on/off ratio of OTFTs for 500 
stretching cycles at 25% strain along channel length and width directions, 
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Figure 3 | Stretchable transistor 
characterization. a, Photograph and 
architecture of a fully stretchable 

5 x 5 organic transistor array 
fabricated with our developed 
stretchable conjugated polymers. 
CNT/PEDOT:PSS, carbon nanotube/ 
poly(3,4-ethylenedioxythiophene) 
polystyrene sulfonate. Inset, 

magnified source and drain electrodes. 
b, c, Mapping statistical distribution 

of the field-effect mobility pp in our 
transistor array with average mobility 
0.286 + 0.162 cm? V~! s~! (number of 
devices, n= 25 from a single substrate). 
d, e, Output (d) and transfer (e; for 
increasing and decreasing Vg as 
indicated by the arrows) curves of a 
fully stretchable 5 x 5 organic transistor 
array. Inset in d shows magnified 
output curves of the transistor at low 
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indicate the gate current, Ig. 
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of our knowledge, this is the first demonstration of complete healing 
(albeit non-autonomous) of a high field-effect mobility conjugated 
polymer after mechanical damages. 

Equipped with our newly developed high mobility and high stretcha- 
bility polymer semiconductor, we proceed to fabricate fully stretchable 
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respectively. f, Photographs of folded, twisted and stretched OTFTs on 
human skin. g, OTFT mobilities after various extreme human motions. 

h, Mobilities of mechanically damaged OTFTs before and after healing 
process. Error bars in g and h are mean + s.d. (five batches of samples were 
prepared, and one OTFT from each sample was measured). 
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OTFTs (5 x 5 arrays; Fig. 3a, Supplementary Fig. 29 and Extended Data 
Fig. 5). Our obtained device yield is 100%, along with good switching 
performance (Fig. 3b-e). Furthermore, most of the devices exhibited 
field-effect mobilities in the range of about 10~'cm* V~! s~! with >10° 
on/off current ratio (Supplementary Fig. 29). In specific, the highest 
value obtained for the mobility is 0.6cm* V—! s~', while the average 
mobility was 0.28 cm? V~! s~'. In general, we observed that the fully 
stretchable OTFTs all exhibited lower mobilities compared to OTFTs 
as fabricated on rigid Si/SiO2 substrates treated with self-assembled 
monolayer of n-octadecyltrimethoxysilan (OTS). We attributed this 
observation to the lower applied electric field with a thicker layer of 
dielectric on the stretchable OTFTs devices (Supplementary Fig. 30). 
The stretchable devices showed little hysteresis and low gate-leakage 
currents. These measured performance parameters represent the best 
of the current reported stretchable organic semiconductors and fully 
stretchable OTFTs (Supplementary Tables 3 and 4)?7-°°. 

To verify the stretchability of our organic transistors, we investigated 
their electrical performance by subjecting them to various strenuous 
mechanical strain conditions. The transistors showed a slow linear 
decrease in mobility when being strained up to 100%. After releasing 
the strain, the mobility was observed to recover to close to its initial 
values (Fig. 4b and Supplementary Fig. 31). The same trend was also 
observed for the on/off current ratios. These trends are qualitatively 
similar to the trend observed when only the polymer semiconductor 
is stretched. Most practical applications, however, require effective 
operation only for applied strains of 20% to 30%, so we performed 
stretching durability tests at 25% strain and 500 cycles for our fabricated 
stretchable transistors (Fig. 4d, e). We observed that both the field- 
effect mobility and on/off current ratio of our transistors did not suffer 
any noticeable decrease in performance. Furthermore, we mounted our 
fabricated devices on human limbs to test device tolerance to various 
common movements, such as folding of the arm, twisting of the hand, 
and stretching of the elbow. Our device is able to maintain its average 
mobility at >0.1 cm? V~' s~! (Supplementary Fig. 32) under all these 
conditions. Finally, the healing ability of the polymer semiconductor 
can again be observed using the heating and solvent annealing process 
we developed (Fig. 4h and Supplementary Fig. 33). 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


All details of materials synthesis and structural characterizations can be found in 
the Supplementary Information. 

Materials synthesis and characterization. Synthesis of DPP-based conjugated 
polymers containing non-conjugated PDCA units in the backbone was carried 
out via Stille polymerization using different ratios of PDCA-containing mono- 
mers (see Supplementary Information). Branched alkyl chains were installed 
on the DPP monomer in order to increase solubility and 7-1 stacking. We also 
selected (E)-2-(2-(thiophenyl-2-yl)vinyl)thiophene (TVT) as co-monomer. The 
non-conjugated monomer was reacted with the brominated DPP monomer and 
the distannylated TVT under Stille polymerization conditions to afford P1 to P4. 
Polymers were purified by precipitation in methanol followed by Soxhlet extraction 
using methanol, acetone, hexanes and chloroform. All polymers showed good 
solubility in different solvents such as chloroform and chlorobenzene. Increasing 
the non-conjugated moiety content within the backbone also improved the solu- 
bility in polar solvents such as THF and DME. The synthetic procedures for pre- 
cursors, monomers and polymers are detailed in the Supplementary Information. 
Molecular weight, polydispersity index, monomers and polymer characterization 
are all detailed in the Supplementary Information. All results are from polymers 
with number average molecular weight M, over 14kDa and with a polydispersity 
below 4. Polymers with low M, values showed less improvement in stretchability. 
Thin-film characterization. To be consistent with films used for transistor fabrica- 
tion and characterization, all films tested have thicknesses in the range 30-35 nm. 
They were spin-coated on OTS-treated SiO2/Si substrate from chlorobenzene 
solution (5mg ml~') at 2,000 1r.p.m. in glove box and thermally annealed at 150°C 
for 10 min. They were subsequently transferred to PDMS elastic substrates. The 
thin-film annealing temperature was found to affect stretchability. Significantly less 
improvement was observed for samples annealed at 150°C for 30 min. Therefore, 
annealing for 10 min was used for all films. The morphology of polymer films was 
measured by optical microscopy and AFM. The molecular stretchability of poly- 
mer film was investigated by polarized ultraviolet-visible spectroscopy. The elastic 


modulus of thin-films was measured using the buckling method (Supplementary 
Fig. 24). Grazing incidence X-ray diffraction was conducted at beamline 11-3 of 
the Stanford Synchrotron Radiation Lightsource. The incidence angle was 0.12 and 
the X-ray wavelength was 0.9758 A, corresponding to a beam energy of 12.7 keV. 
All grazing incidence X-ray diffraction images were collected in reflection mode 
with a two-dimensional area detector and the sample under a helium atmos- 
phere. Stretching and relaxing rates of the polymer thin films were performed at 
0.65mm s_! (strain rate 2.6% s~'). 

Conventional organic transistors. The semiconducting polymer layer (35 nm) 
was spin-coated on OTS-treated SiO,/Si substrate in glove box and was thermally 
annealed at 150°C for 10 min. The source and drain electrodes (Au, 40 nm) were 
thermally evaporated under 5.0 x 10° Torr. 

Skin-inspired organic transistor. Carbon nanotube (100 nm) film as the gate 
electronic was transferred onto polystyrene-block-poly(ethylene-ran-butylene)- 
block-polystyrene (SEBS) elastomer. The PDMS layer (1.8 1m) was then transferred 
onto a carbon nanotube/SEBS substrate. Then the semiconducting polymer layer 
(35nm) was transferred on the PDMS dielectric layer. Finally, the PEDOT:PSS 
(30nm)/carbon nanotube (70 nm) bilayer as the source and drain electrode was 
spray-coated onto the semiconducting layer. The transferring method for the 
device fabrication is depicted in Extended Data Fig. 5. 

Healing process. For the solvent vapour treatment, the damaged sample is exposed 
to CHCl, vapour in a closed jar for 10 min at 40°C. For the heat treatment, the sample 
is thermally annealed on a 150°C hotplate for 30 min. For the combined solvent 
vapour and heat treatment, the sample first undergoes CHC]; vapour annealing 
(10 min at 40°C) and then thermal annealing (30 min on hotplate at 150°C). 
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Extended Data Figure 1 | General characterization of P1-P4.a, Polymer _ cyclic voltammetry. Potentials versus Ag/AgCl using 0.1 M TBAPF, 


composition by 'H NMR. The ratio of PDCA moieties incorporated in (tetrabutylammonium hexafluorophosphate) in CH3CN (acetonitrile) 
the polymer backbone is determined by the integration of protons (1) as the electrolyte solution. ‘Calculated using the following equation: 
versus the alkyl-chain terminal protons (2). b, General characterization gap = 1,240/ onset of polymer film. (PDI, polydispersity; Ty, degradation 
of P1-P4. *Determined from thermogravimetric analysis. The HOMO temperature; My, weight average molecular weight; \, absorption 
(highest occupied molecular orbital) energy level was calculated from wavelength in ultraviolet/visible spectroscopy). 
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Extended Data Figure 2 | Intermolecular interactions between PDCA 
moieties. a, Chemical structure of model compound 1 and 'H NMR 

at various concentrations of compound 1 in CDC];. Upon increasing 
concentration from 0.05 M to 0.8 M, a distinct shift of the amide proton 
(black arrow) towards low fields is observed. This indicates hydrogen 
bonding between the PDCA moieties, as previously observed"). 

A dimerization constant of 0.18 M_! was determined by plotting 
concentration versus chemical shift and fitting using a dimer association 
model™. b, Molecular structure of 1 showing intermolecular hydrogen 
bonds determined by single-crystal X-ray diffraction. Ellipsoids are set 
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at the 30% probability level. Selected hydrogen atoms are omitted for 
clarity. c, Chemical structure of model oligomer M1 and 'H NMR of 

M1 at various temperatures in 1,1,2,2-tetrachloroethane-d, showing the 
amide NH protons peak. The chemical shift upon temperature increase 
indicates a breaking of the hydrogen bonds formed between the polymer 
chains (shown as insets). Oligomer M1 was used for this study because 
the solubility of the polymer was not sufficiently high to perform a similar 
study under high concentrations. (a.u., arbitrary units; ppm, parts per 
million.) 
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Extended Data Figure 3 | Behaviour of polymer thin-film under strain. 
a, Optical microscope images of P1, P3 and P5 as function of applied 
strain (0%-100%). b, Height and phase AFM images of P3 under 100% 
strain showing no crack formation. c, Table of grazing incidence X-ray 


diffraction data for P1 and P3 films as a function of strain (0%-100%). 
The samples are annealed at 150°C for 10 min. A reduction in the mean 
size of crystallites is observed for P1 to P3. The semiconductor is 35 nm 
thick. (FWHM, full-width at half-maximum.) 
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Extended Data Figure 4 | Thin-film characterization of damaged and 
healed P3 and P5. a, b, Atomic force microscopy images of damaged (a) 
and healed (b) thin film of P3 after solvent and thermal annealing. We 
note that all the previously observed nanocracks were absent after the 
healing process. ¢, Dichroic ratio of P3 healed thin film as determined by 
polarized ultraviolet-visible spectroscopy. d, e, Atomic force microscopy 
images of damaged (d) and healed (e) thin film of P5 after solvent and 
thermal annealing. We observed that a small number of nanocracks 
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remained in the film. f, Dichroic ratio of P5 healed thin film as determined 
by polarized ultraviolet-visible spectroscopy. We observed that the 
dichroic ratio of healed film of P3 fully recovered to a value similar to 

that of the pristine film without damage. On the other hand, when P5 

is subjected to the same treatment, the dichroic ratio was not restored, 
indicating that the movement of the polymer chains was insufficient to 
restore the film’s mechanical properties. 
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Extended Data Figure 5 | Fabrication and electronic properties of a 
fully stretchable 5 x 5 transistor array. a, Fabrication process of fully 
stretchable OTFTs. (1) Transfer printing of the carbon nanotube gate 
electrode as prepared by spray-coating a carbon nanotube solution 

(10 mg ml“ in CHCls) on SEBS substrate (thickness 200 jum). (2) Contact 
transfer printing of PDMS dielectric layer as prepared by spin-coating 

a diluted PDMS (220 mg ml! in CHCl;) on OTS-treated SiO, substrate 
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on a carbon nanotube gate electrode. (3) Contact transfer printing of 
semiconducting polymer layer (prepared by spin coating onto OTS-treated 
SiO. substrate) on PDMS dielectric layer. (4) Spray coating of carbon 
nanotube (70 nm)/PEDOT:PSS (30 nm) source and drain electrodes on 
the semiconducting layer. b, c, On/off current mapping (b) and statistical 
distribution (c) of 5 x 5 fully stretchable OTFT arrays (width 1,000 zm; 
length 150,1m). Number of devices from a single substrate, n = 25. 
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Abstract— This paper introduces a novel approach to correct 
stripe non-uniformity in infrared images. The differences 
between the bias of the column readout circuit of infrared focal 
plane arrays result in strong stripe noise. We propose that the 
bias of the column readout circuits can be observed by using a 
1D bilateral filter on the average signal of the columns of 
infrared images. In this paper we regard the bias of column 
readout circuit as Gauss-Markovy process. The proposed Gauss- 
Markov model provides a mechanism to capture the slow drift of 
the bias. A Kalman filter is derived that uses the observed bias to 
update the bias estimates. Finally, the infrared images are 
corrected using the estimated biases. The effectiveness of the 
method is shown according to the experimental results on real 
infrared images. 


Keywords— Infrared image; stripe non-uniformity; Fixed 
pattern noise; Infrared focal plane array; Kalman filter 


I. INTRODUCTION 


Infrared imaging is very useful in industry, military and 
medical application since it can acquire infrared information. 
Infrared focal plane arrays (IRFPA) are employed 
predominantly in infrared imaging systems due to its 
numerous advantages such as compactness, production cost 
effectiveness, and high sensitivity[1]. An IRPFA is composed 
of small detectors and readout circuits. Because of 
mismatching of components and variations of parameters in 
the fabrication process, the spatial responses of pixel detectors 
and the bias voltages of readout circuits are different[2, 3]. 
Non-uniformity brings about the notorious non-uniformity 
noise, termed fixed pattern noise, which degrades quality of 
the acquired infrared images strongly. Non-uniformity 
correction (NUC)is a necessary step to improve infrared 
images[4]. Moreover, stripe non-uniformity which is due to 
non-uniformity of readout circuits is a special kind of non- 
uniformity, which should be considered as a special case[5, 6]. 
Since detector parameters drift slowly in time, one-time 
calibration of infrared focal plane array (IRFPA) cannot 
provide an effective solution to the NU problems. NUC must 
be performed repeatedly as drift occurs. 

Non-uniformity correction techniques are categorized 
mainly into two classes: calibration-based and scene-based 
techniques. Calibration-based methods[7, 8] requires a 
uniform reference scene (typically, a blackbody radiation 
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source) in order to calibrate the response of each detector 
across the array so that all of detectors produce accurate and 
uniform readout values at the two different reference 
temperatures. This kind of NUC techniques often achieves 
radiometrically accurate readout values. However, since they 
need to interrupt the operation of the camera during the period 
when calibration is conducted, they are not convenient for real 
time applications. The calibration is applied in a long interval 
of time during which the parameters is considered as constant. 
Scene-based techniques rely on signal processing to remove 
the NU noise. They typically use an image sequence captured 
during the normal operation of the camera, and _ their 
performance relies on global motion in the sequence to 
provide diversity in the scene temperature per detector. 

To date, numerous scene-based NUC techniques have been 
reported in the literatures. Harris and Chiang[9] developed 
algorithms that associate the mean and standard deviation of 
each detector with the gain and bias parameters by using the 
constant statistics assumption. This assumption postulates that 
the input irradiance is a uniformly distributed random variable 
and all pixel detectors should have the same mean and 
standard deviation. Under this assumption, the bias and gain 
of each detector can be estimated by employing linear mode. 
But this assumption is reasonable only if image sequence is 
long and so enough motion that each detector sweeps many 
different parts of the scene. Hayat et al.[10] proposed a finite 
impulse response (FIR) least-mean-square (LMS) filter that is 
used to estimate the gain and bias of each detector. The 
algorithm updates the gain and bias by use of subsequent 
images online and uses the updated parameters to correct 
infrared images. Torres et al.[11] developed a statistical 
technique called Constant Range that relies on the key 
assumption that all detectors are exposed to approximately the 
same range of irradiance. Ratliff et al. [12]developed an 
algebraic scene-based NUC technique that does not rely on 
any statistical or scene-diversity assumptions about the scene 
temperature. This algorithm employs estimates of interframe 
sub-pixel shifts in an image sequence and linear interpolation 
model for the motion to extract information on the bias non- 
uniformity. Torres and Hayat [13]model the gain and the bias 
of each detector as a Gauss-Markov random variable, and use 
a Kalman filter to update these estimates of the gain and bias 
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of each detector. Pezoa et al.[14] utilizes a ban of Kalman 
filters in parallel to estimate the gain and bias of each detector 
according to its own dynamic-model parameters. The final 
estimates of the gain and bias is achieved by forming a 
weighted superposition of all the estimates rendered by each 
Kalman filter. 

Stripe noise is one of main fixed pattern noise caused by 
variation of the biases of column integration readout circuits 
in infrared imaging system[15, 16]. Since the detectors in the 
same column of IRFPA share the one column readout circuit 
and the detectors in the different column of IRFPA use 
different column readout circuit, the difference between the 
biases of the column readout circuits produces the different 
readout values at different columns when the detectors is 
exposed to the same radiation. The usual NUC techniques are 
not applicable to eliminate stripe noise. Stripe noise varies 
frame by frame, but calibration-based NUC methods cannot 
update parameters in time, which regard the parameters as 
invariable in a long time between two successive calibrations. 
The scene-based NUC techniques can resolve the problem that 


parameters drift slowly. However two problems may arise[17]: 


1) a long image sequence is required for convergence. 2) a 
long image sequence may results in a “ghost artifacts” due to 
the previous images appears in the last one. In this paper, we 
propose a novel correction approach to eliminate stripe non- 
uniformity only using scene images. Assuming that the 
neighboring pixels in images are very similar, the difference 
between the pixels of neighboring column is caused mainly by 
biases of column readout circuit. Under this assumption, we 
proposed an approach to estimate the biases of readout circuit 
by using Kalman filter, and stripe non-uniformity is corrected 
using the estimated biases. 

The paper is organized as follows. Section 2 presents the 
model for stripe non-uniformity of infrared images. The 
recursive estimate of the bias of column readout circuits is 
described in section 3. Experiments on real infrared image are 
described in section 4. Section 5 contains a conclusion. 


Il. MODEL FOR NU STRIPE NON-UNIFORMITY 

Stripe non-uniformity in infrared images is due to 
difference between column integration readout circuits. We 
can make the following assumption: 

1) The biases of column integration readout circuits can be 
regarded as unknown random variables. The pixels in the 
same column of IRFPA share the same bias. 

2) The biases of different columns of IRFPA are 
independent of each other. 


As a consequence of assumptions, the output of the ijth 
detector of the Ath frame is represented by 

YG )=U,G +X, DAVG Ds 

i=1,---M,j=l--,N 


where X,(/) is the bias of the jth column readout circuit 


(1) 


in IRFPA. U, (i, j) is the infrared radiations collected by the 
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ijth detector in the kth frame. V (i,j) is the additive 

electronic noise. The sum of the value of infrared image 

Y,(i, /) along with column can be written as 
Y(DN=U,)+4,Q) th.) (2) 


Assuming that the infrared radiations of neighbouring 
pixels in space are similar, the differences between columns of 


the image Y,(i, 7) are mainly caused by the bias X,(/). 
According to above assumption, U,(j) should be a slow 
changing signal, and X,(/) is considered as the additive 


temporal noise associated with the readout circuit of jth in 
infrared image. 

We can separate the column signal Y, (j) into smooth part 
Y2(J) and detail part Y,?(j) by applying a 1D low pass 
filter. The filtered output Y,’(j) is computed as a weighted 
average of the signal Y,(j). The detail signal Y,?(j/) is 
obtained by subtracted ye (j) from Y,(/) as following 

Dp a Bre 
Y Q=Y,Q-Y% VY) (3) 

The linear low-pass filter assigns weight coefficient only 
depending on spatial distance, which suffers with serious halo 
artifacts when there is a rapid variation of the signal. To solve 
this problem, our method adopts a nonlinear bilateral filter. 


The bilateral filter[18] are descripted by the following 
equation, 


a) = FDS e-xhl x—x')A, (ll F@)— FED) 
(4) 


where k(x) is normalized factor, h,(x) is spatial kernel 


x 


function and h,.(x) is range kernel function. The key idea of 


bilateral filter is to avoid excessive smoothing of signal with 
rapid variation by using range kernel function, since 


h,(x) will assign a smaller weight coefficient with larger 
rang difference between f(x) and f(x') . The detail part 
Y,?(J) of the column signal Y, (j) can be regard as the 


observation values of the bias of column readout circuits. 
III. RECURSIVE ESTIMATION OF THE BIAS 


The observed bias ves (j) of the column readout circuits 
includes noise and observation error, so it can be written as 

Ye (=X D+ NG) (5) 

where N,(j) is an additive temporal noise with Gauss 


F 2, 3 
and variance Oj, Observation vector 


kth 


distribution 


corresponding _ to frame is expressed as 


ie [YP MVP). ¥P(N) | , which is an array of 


length N of column signal. Ultimately, we hope to achieve 
the minimum mean square error (MMSE) estimation of the 


state vector (bias vector) X, from the observation sequences 


k 
{y,?} in a recursive way. 
i=0 
Assuming that the variation of the bias of column readout 
circuits is a Gauss-Markov process, the bias X,,, in 
(k+1)th frame can be represented as autoregressive model, 
Xp = AX, +, (6) 
where A, = diag (a,(1),a,(2),:+,a,(N)) is called the 
state transition matrix and 0 <a, (i) <1,i=1,---,N. W, is 
gauss noise vector with identical and independent distribution, 


its covariance is diag (O°.,*++,0-) . The drift of the bias 


cannot lead to the long-term change of the dynamic range of 
the bias of readout circuits, so the average drift should be kept 
as zero. Assuming that the parameters of state transition 


matrix is same, Le. a,(1)=-:-=a,(N)=a , the mean of 
noise W, should be 
M, =E|W,|=X, (1-4) 


where X, represents initial state. 


(7) 


Given the observation vector Y, , the MMSE estimator of 


state vector X, is represented recursively as 


X,=%,+K,[%,-% | (8) 


where is called the defined as 


xy 
Xe 2 E[X, | Y, oo, Y | , and can be solved iteratively using 

X, =A Xa tM (9) 
where M,_, is given at Eq.(7). The matrix K, is called 


priori estimate, 


Kalman gain matrix, and can be computed by using 


K,=P,[P-+®] (10) 


where @® is the covariance matrix of observation noise. Po 


is called a prior error covariance and defined as 
PSE [x eo 3 Fe y| , which can be computed iteratively 


by using 
Po = ALP Aa tus (11) 


The error covariance matrix P. = E es aw? ye can be 


k = 
updated by using 


f= (1-H Fe (12) 
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To execute the above iterations, the initial conditions for 
state vector and error covariance matrix is given as following, 


X,=¥ (13) 
P, = diag(Oy,-*, On) (14) 
Finally, the observed infrared image Y, (i, j) is corrected 
using the estimate bias xX . 
RG D=YGD- XW) 


IV. EXPERIMENTAL RESULTS 


(15) 


In this section, we discuss the experimental results obtained 
with the proposed algorithm. The data used in our experiment 
is realistic infrared images from a public infrared image 
dataset[19]. The experiments are carried out in MATLAB 
R2009a on a personal computer equipped with Windows (2.8 
GHz Intel Core i5 CPU, 4 GB memory). The proposed 
algorithm is compared with the state of the art stripe noise 
removal method based on MHE[17, 20]. 

As shown in Fig.l, stripe noise in infrared images is 
reduced obviously using both MHE and our method. Although 
the MHE method reduce stripe noise, residual stripe noise is 
still obvious in images. Our noise removal results are better 
visually than MHE. 


V. CONCLUSIONS 


In this paper, we have presented a new technique to correct 
the stripe non-uniformity in infrared images. We take into 
account the fact that the stripe noise is produced by the non- 
uniform biases of column readout circuits in IRFPA, a novel 
approach is introduced to recursively estimate and correct bias 
non-uniformity. The proposed techniques works on average 
signal of the columns of infrared image. The observation of 
the bias of each readout circuit is achieved by using a 1D 
bilateral filter. Through modelling the bias by Gauss-Markov 
process, the proposed method captures the drift of the bias in 
each column. To achieve the recursive estimation, we derived 
a Kalman filter to estimate the bias of each column. Finally, 
we correct each detectors of infrared image using the 
estimated bias. The proposed algorithm is compared to the 
state of the art strip non-uniformity correction algorithm based 
on MHE using realistic infrared image. The experimental 
results show that our proposed algorithm obtains better effect 
than MHE. 
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Implications for metal and volatile cycles from the 
pH of subduction zone fluids 


Matthieu E. Galvez!, James A. D. Connolly! & Craig E. Manning? 


The chemistry of aqueous fluids controls the transport and 
exchange—the cycles—of metals'~* and volatile elements*®” on 
Earth. Subduction zones, where oceanic plates sink into the Earth’s 
interior, are the most important geodynamic setting for this 
fluid-mediated chemical exchange”®!°. Characterizing the ionic 
speciation and pH of fluids equilibrated with rocks at subduction 
zone conditions has long been a major challenge in Earth science’. 
Here we report thermodynamic predictions of fluid-rock equilibria 
that tie together models of the thermal structure, mineralogy and 
fluid speciation of subduction zones. We find that the pH of fluids in 
subducted crustal lithologies is confined to a mildly alkaline range, 
modulated by rock volatile and chlorine contents. Cold subduction 
typical of the Phanerozoic eon” favours the preservation of oxidized 
carbon in subducting slabs. In contrast, the pH of mantle wedge 
fluids is very sensitive to minor variations in rock composition. 
These variations may be caused by intramantle differentiation, or by 
infiltration of fluids enriched in alkali components extracted from 
the subducted crust. The sensitivity of pH to soluble elements in 
low abundance in the host rocks, such as carbon, alkali metals and 
halogens, illustrates a feedback between the chemistry of the Earth’s 
atmosphere-ocean system!*)> and the speciation of subduction 
zone fluids via the composition of the seawater-altered oceanic 
lithosphere. Our findings provide a perspective on the controlling 
reactions that have coupled metal and volatile cycles in subduction 
zones for more than 3 billion years’. 

Water-based chemical reactions are essential to Earth’s surface 
chemistry. Water is also crucial to our planet's inner chemical dynamics. 
Subduction zones are the primary locations where surface water is 
delivered to the deep Earth, promoting low-temperature (T) melting 
of the crust’® and mantle’, affecting the rheology of the lithosphere’, 
blurring pre-metamorphic signatures of ancient Earth’s palaeo- 
environmental conditions'®'* and contributing to the global recycling 
of volatiles® * and metals”**, Consequently, characterizing the reactiv- 
ity of supercritical aqueous solutions in subduction zones is a problem 
of both scientific and societal relevance in Earth science. The key vari- 
ables that control rock mineralogy and fluid chemistry are pressure (P), 
T and composition, which in combination govern, for example, fluid 
redox state®, speciation? and electrical properties''. Although the redox 
budget of subduction zones has been the subject of growing attention’®, 
obtaining quantitative constraints on their acid—base budget and on 
the ionic (for example, the pH; refs 5, 12) and molecular composition 
of fluids at mantle conditions has proven difficult in both theory and 
experiment!)!”, Recent molecular dynamic simulations have extended 
the range of conditions at which the solvent property of water (its 
dielectric constant ¢,, Extended Data Fig. 1) is known to upper-mantle 
conditions'*”°, and these results have been generalized for geologi- 
cally ubiquitous carbonic fluids*!””. However, geochemical applications 
remain hindered by insufficient knowledge about the sensitivity of fluid 
properties to changing mineralogical assemblages at the elevated P and 
T prevalent in the Earth’s interior. 


We address this problem using a thermodynamic approach” 
that connects fluid composition and speciation to predicted rock 
mineralogy as a function of P and T. After determining the P-T 
stability fields of solid-solvent phase assemblages for crustal and mantle 
lithologies (Fig. 1, Methods and Extended Data Table 1), the chemical 
potentials from the mineralogical model are used to derive fluid 
speciation (Figs 2 and 3) using oxide hydrolysis equilibria represented, 
for example, by the alkali - proton exchange reaction: 


Na,O + 2H* =2Nat+H,0 (1) 


We assume fluid-rock equilibrium holds at all conditions, as supported 
by rapid rates of mineral dissolution and precipitation in hydrothermal 
fluids’; our results hold for common scenarios of pervasive fluid flow 
at low transport rates where grain boundary diffusion dominates. 
Redox reactions of Fe have been discounted, that is, the redox com- 
position of the fluid (O/H ratio) is determined by the devolatilization 
process. At equilibrium, under the assumption of rock-dominated 
chemistry, the chemical potential (j) of oxides constrains fluid 
speciation by: 


Lynavo” HHo = 2 (Lygt — Ht) 
Ce aire) na 


solute speciation 


(2) 

mineralogical model 
The partial molar Gibbs energy of the ith solute is y;=g> + RT 
In(y,m;), where g, is the standard-state Gibbs energy of 1 mol of solute 
per kg of solvent at P and T. Here 7; is the solute activity coefficient, m; 
the molal concentration, their product 7m; is the activity (a) of i, and 
pH= —logjo(ay1). Thermochemical data for minerals, solvents 
and solutes are reported in Methods and Extended Data Table 2. This 
approach provides fluid speciation and pH by charge balance 
(Methods), and is original in that it tracks the release of all soluble 
elements simultaneously. We focus on pH, and particular emphasis is 
placed on testing the sensitivity of predictions to change in P-T condi- 
tions and rock composition (Figs 2 and 3). 

The fluid alkalinity is defined by ApH = pH — pH,, which tracks 
the acidic (ApH <0) or alkaline (ApH > 0) character of the fluid 
with respect to acid-base neutrality. Neutral pH (pHn=0.5[pKy — 
logio(4y,0)]) is defined as the condition where ay+=aoy-, and, 
accordingly, model ApH takes into account the modification of water 
activity and ionization constant (K,) in the mixed solvents. Crustal 
pelite and basalt exhibit similar isothermal trends of ApH when H20 
is the solvent. This pattern reflects uniform variations in partial molar 
volume of Na,O in these systems where sodic phases include feldspar 
(albite, Ab), mica (paragonite, Par), clinopyroxene (jadeite, Jd) and 
amphibole (Fig. 1a, b). The ApH reaches a maximum at conditions 
corresponding to the Ab = Jd + Quartz (Q) reaction, and is in the range 
2.30.2 along representative cold and intermediate P-T paths 
(Figs 1, 2d). The ApH of fluids equilibrated with an alkali-free mantle 
peridotite spans a larger range. It peaks near 2.1 + 0.2, reflecting incom- 
patible behaviour of calcium in the ultramafic system, at 400-500°C, 
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Figure 1 | ApH of solutions in equilibrium with basalt, pelite and 
peridotite lithologies. Here ApH = pH — pH, describes the deviation of 
fluid pH from pH at acid-base neutrality, pH, (see text and Methods). It 
is plotted against the stable mineral assemblages (that label each field) as 
a function of P and T for basalt (a), pelite (b) and peridotite (alkali-free 
lherzolite) (c) compositions (Extended Data Table 1) at equilibrium with 
H,0. Mineral abbreviations (defined in Methods) in bold denote phases 
present in the entire field, those in plain text appear in some parts of 

the field. Indicative amounts of H2O structurally bound in minerals are 
given in parentheses in wt% below the characteristic phase assemblage. 
Decreasing amount of H,O in the mineral assemblage with heating 
indicates progressive rock dehydration. The carbonated basalt (Fig. 2) 


and declines to about 1.0 with heating and transition to chlorite- and 
garnet-peridotite along cold and intermediate P-T paths (Figs 1c, 2d). 
These trends are maintained along the path representative of hot 
Precambrian subduction zones’? (Figs 1a, 2a). 

Carbonic solvents formed if graphite or carbonate are present in the 
pelite (redox neutral solvent with atomic ratio O/H = 1/2) and basalt 
(oxidized, O/H > 1/2), respectively, are characterized by lower H,O 
activity (0.6 <ay,9< 1). The ApH of the C-bearing solutions decreases 
by up to 0.7 units for the carbonated-basalt fluids (Fig. 2b, e). Their 
carbon abundance (Fig. 3a) is about one order of magnitude greater 
than in fluids equilibrated with graphitic pelite!®. Although metal 
solubilities are semiquantitative at these conditions”!”4, the (Na -+K)/Al 


differs from the C-free system by the presence of calcite/aragonite (up to 
~1.3 GPa), dolomite (~1-1.9 GPa) and magnesite above ~1.9 GPa. Two 
Phanerozoic P-T paths (white dashed lines), that is, Honshu (‘cold’ path, 
labelled 1) and Columbia (‘intermediate’ path, 2) are indicated in a along 
with a ‘hot’ P-T path (path 3) (Methods). Dash-dotted curves in grey are 
approximate experimental locations of the wet basalt/pelite solidii in a 

and b, and also show the carbonated-basalt solidus in a (Methods). All 
fields for basalt and pelite, with the exception of those marked with an 
asterisk where quartz is not stable at P< 0.7 GPa and T > 600°C, contain 
quartz or coesite (Coe). Thick lines designate boundaries between classical 
metamorphic facies referred to in Fig. 2. Precision on absolute ApH values 
is +0.2 ApH. Accuracy is within 1 ApH unit (Methods). 


ratios of fluids increases, by a factor of 2 to 20 for pelite and basalt, 
respectively, with minor bulk-rock solubility increase. These modifi- 
cations are due to competing solvent (H2O dilution with non-dipolar 
molecules such as CO, and CHy,), pH (carbonate equilibria and 
equation (2)), and speciation effects (alkali-carbonate ion pairing). It 
illustrates that the ability of subduction zone fluids to dissolve metals, 
irrespective of their accessible oxidation state, depends on the fluid’s 
volatile speciation, and thus redox state. Similarly, the solubility of C 
and the C/Ca ratio in the basaltic fluid increase with heating and 
decompression (Fig. 3a, b). These trends are supported by experimental 
evidence”, and controlled by incongruent dissolution of Ca-Mg-Fe 
carbonate. Along hot to intermediate geotherms, crustal and oceanic 
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Figure 2 | ApH along three representative Precambrian and 
Phanerozoic P-T paths. The three paths are shown in Fig. 1a. a-f, ApH 
of a pure H,O solvent (a, d) at equilibrium with peridotite (grey), basalt 
(red) and pelite (yellow) along hot (a-c), intermediate and cold P-T 
paths (d-f) (see Methods for details). b, e, Results obtained for a 
carbonic solvent, that is, with C added to the pelite (redox-neutral 
solvent , ac = 1, O/H =0.5 (atomic ratio) and aco, © acu,) and basalt 
(H20-CO; solvent, O/H > 0.5). Greater C solubility and CO> ionization 
(carbon species include, CAHCO} CaCO$, HCO;, NaCO;, CO}~ 


mantle dehydration occur at shallow depths (<3 GPa 80km, Fig. 1) 
and are concomitant with elevated C solubility (4-10 mol C per kg H20, 
Fig. 3b), favouring ample fore-arc carbonate dissolution and melting 
(Methods, Extended Data Table 3). This regime characterizes hot 
Precambrian subduction zones and rare modern equivalents. The cold 
path differs fundamentally in that slab dehydration is limited, and 
confined to elevated P (Fig. 1a) and low T where the solubility of 
carbon is relatively low (0.3-3 mol per kg H2O; Fig. 3a). Thus, cold 
paths typical of the Phanerozoic favour the mechanical incorporation 
of carbonates in the deep upper mantle (Methods, Extended Data 
Table 3), in agreement, for example, with the ubiquitous persistence 
of Ca-Mg-Fe carbonates in ultra-high-P terranes on Earth”®. This tem- 
poral directionality implies that the return of surface O to the mantle 
by slab carbonates is enhanced by subduction cooling over geological 
time (Extended Data Table 3). The formation of extensive domains of 
mantle redox heterogeneities, and their associated processes!°7, may 
bea geologically recent phenomenon. 

Addition of chlorine in amounts typical of dilute subduction zone 
fluids”® shifts predicted ApH closer to neutrality regardless of thermal 
structure (Fig. 2c, f). Thus, at T> 500°C, the ApH of the three 
studied lithologies is confined to a narrow mildly alkaline range (Fig. 2). 
Larger contrasts develop at lower temperatures, or at elevated T and 
P lower than 1 GPa (Fig. 1) characterized by enhanced ion association 
and complex neutralization, and, thus, pH is a less pertinent system 
variable. The ApH is particularly sensitive to seawater components 
in the altered oceanic lithosphere such as carbon, alkali and halogen 
(Fig. 2). Slab thermal structure plays a subordinate role in determining 
pH, but exerts an important control on the geochemical systematics of 
the volatile-rich fluids (Fig. 3a, b). 

These findings are also illustrated by the evolution of fluid chemistry 
across a chemical profile ranging from an alkali-depleted mantle 
composition to a model slab fluid composition (component F in 
Fig. 3c, d and Extended Data Table 1). This scenario tests the role of 
chemical variability, introduced by metasomatism or intramantle 
differentiation, on fluid chemistry. At 600°C, 2 GPa and minor rock 


Na2O content (wre) of 0-0.1 wt% (in clinopyroxene), that is, alkali 
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Temperature (°C) 


Temperature (°C) 


+ HCOO , Extended Data Table 2) account for larger ApH drop in the 
carbonated basalt compared to the graphite-bearing pelite system. 

c, f, Results obtained for carbonic solvents including 1 molal total 
chlorine (Methods). The effect of chlorine is most pronounced at low-T 
conditions where it is fully ionized, and its abundance in the fluid 
exceeds the ionic strength of Cl-free solutions, I~ 0.1, by ~1 order of 
magnitude. Major metamorphic facies transitions are indicated in a, d, 
and acid-base neutrality (ApH = 0) is indicated by the horizontal 
baseline in each panel. The P-T paths refer to those defined in Fig. 1. 


compositions typical of depleted supra-subduction-zone mantle 
peridotite”, the pronounced increase of jy, 9 (Extended Data Fig. 2) 
is not fully accommodated by enhanced deprotonation of, for example, 
Si(OH)$ to SiO(OH); (Extended Data Fig. 3) and leads to a residual 
ApH increase exceeding 2 log units (Fig. 3c, d). The variations deviate 
significantly from a linear mixing trend between endmember fluid 
compositions, and highlight the limited acid-base buffering capacity 
of ubiquitous ultramafic mineral assemblages characterized by low alkali 
concentrations. Previous models of nitrogen speciation in the mantle” 
may be revised in light of these new constraints. The formation of Na,K- 
bearing assemblages including clinopyroxene, biotite and amphibole 
stabilizes ApH at strongly alkaline values (as previously anticipated 
by ref. 31) ApH > 3-4 over the interval 400-900 °C at 2 GPa; pH 
values then gently decline along a sub-linear mixing trend with further 
addition of component F. Low values of {Usio, in mantle peridotites 
(Extended Data Fig. 2) would act as a geochemical trap for a range of 
silica-bearing polynuclear clusters that carry trace elements in meta- 
somatic fluids derived from the slabs *+°? (Extended Data Fig. 3). The 
formation of mica (at X; = 0.43, Fig. 3d) causes a drop of ApH, anda 
coincident increase of /s:9,; alo, (Extended Data Fig. 2). These 
conditions would promote the high-T formation of silica—aluminate 
complexes*, not considered here for lack of relevant thermochemical 
data, by condensation reactions such as: 

Al(OH); + Si(OH){ = Al(OH)3;0Si(OH); +H,0 (3) 
We anticipate that the formation of these solutes would increase the 
solubility of cations (as required from electroneutrality) and bring 
APH values closer to acid-base neutrality. Further work is needed 
to quantify the efficiency of this mechanism. Experimental strategies 
for pH and electrical transport!! measurements in high-P fluids are 
required. 

Taken together, our results show that acid-base mechanisms 
important for the dissolution, transport and deposition of metals, 
for example pH fluctuations, are not restricted to upper-crustal 
magmatic—hydrothermal environments*°—they are significant in 
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Figure 3 | Metasomatism at the subduction interface. a, C content of a 
fluid at equilibrium with carbonated basalt along cold and intermediate 
P-T paths. b, C content of the fluid along the hot P-T path. C abundance is 
displayed here in moles of C per kg of HzO. Atomic ratio C/Ca along those 
paths is indicated. It is particularly sensitive to the P-T conditions, rising 
very rapidly to above 10° with heating and decompression. Enhancement 
of C solubility with pressure, a peculiar feature confined to P > 2-2.5 GPa 
and T < 500°C (The stable carbonate phase is Fe-Mg carbonate 
“Magnesite’), is due to extensive Ca-carbonate complexing in the fluid. 
Predictions from simple molecular fluid models are indicated along the 
same paths (pale grey area, a). Discrepancies between the models are as 
expected largest at low T and elevated P. Vertical dashed lines indicate 


polar solvents present at the slab interface. Factors controlling these 
fluctuations include (1) mineralogical heterogeneities, (2) the presence 
of minor but soluble components in solid solutions, (3) variations of 
the activity and structure of H,O in mixed-volatile solvents, and (4) 
the speciation of coordinating ligands. In particular, sulfur may form 
abundant dipolar H2S and SO) molecules and various ligands, the dis- 
tribution of which is redox and pH sensitive? and will influence the 
transport of metals such as Au and Cu along the subduction interface. 
Our work places quantitative pH restrictions on the identity and sol- 
ubility of aqueous metal-complexes pertinent to these processes. The 
development and generalization of acid-base formalisms to coexisting 
rock-fluid—melt systems will be necessary at T greater than 750-800 °C 
(Fig. 1a, Methods). 

The present work focused on fundamental chemical characteristics 
of aqueous fluids at upper-mantle conditions, which improves our abil- 
ity to interpret compositional variability in rocks exhumed from the 
Earth’s interior'®?!, Metal—volatile chemical coupling is a consequence 
of mineralogical equilibria and electrostatic interactions in fluids. 
Their nonlinear behaviour and mildly alkaline nature reflect intrinsic 
properties of silicate mineralogies. The sensitivity of fluid pH to soluble 
elements in low abundance in the host rock, such as carbon, alkali and 
halogens, illustrates a feedback between the chemistry of the Earth’s 
atmosphere-ocean system” and the behavior of metals and volatiles 
in subduction zones via the composition of the seawater-altered oceanic 
lithosphere!®. Our results also permit quantification of the electrical 
conductivity of subduction zone fluids". 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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boundaries of metamorphic facies (for example, lawsonite (Lws)-eclogite), 
and indications on C speciation (for example CO), HCO3°). ¢, Variations 
of ApH (colour coded) for T= 400-900 °C and P= 2 GPa obtained by 
mixing peridotite composition with model metasomatic component 

F =Na(K,Ca,Mg,Fe)j/3Ah 5Si7O17.9 (expressed in mole fraction of the 
ultramafic-F oxide mixture, X;) at H2O saturation (H20O sat.). Thick 
black line indicates appearance of white mica (+ Mica) in the assemblage. 
d, Section plot of c at 600°C with indications of mineral assemblages and 
corresponding values of Na,O abundance in the rock (arrows, wys5o in 
wt%). Dashed grey lines indicate linear mixing between endmember fluid 
compositions. 
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METHODS 


Phase equilibria. Phase diagram sections have been computed by forward model- 
ling (Gibbs energy minimization with PerpleX™*) using the 2002 revision of the 
Holland and Powell mineral database*®. Rock compositions are chosen to represent 
model pelite***’, K-free basalt** and peridotite (alkali-free Iherzolite*’) systems 
(Extended Data Table 1) and maintain H,O saturation (ay,9 = 1in pure H2O sys- 
tems, Figs 1, 2a, d) over the entire P-T field for fluid speciation calculation. The 
carbonated basalt is modified from the C-free composition to maintain fluid sat- 
uration (H,O0 ~ 5.8 wt%) with ~2 wt% CO, consistent with average C content of 
the upper basaltic crust*®. Note that a harzburgite peridotite composition (Fe-Mg- 
Al-Si system) would dampen the ionic strength and ApH of the equilibrium fluid 
without significant change to their P-T behaviour. Component F is modified from 
a representative crustal fluid composition from ref. 41. The solid solution models 
used include biotite (Bio), garnet (Grt)°°, chlorite (Chl)*%, staurolite (Sta)*, white 
mica (Mus, Phe or Par)**“°, K-feldspars (Kfs)”, clinopyroxene (Cpx)**, cordierite 
(Crd), chloritoid (Ctd)”, orthopyroxene (Opx)**, plagioclase (P1)*°, amphibole 
(Amp)*), carbonate (Do)*’, magnesite*® , antigorite (Atg)*, olivine (O)** and an 
ideal solution model for talc (T). Minerals considered pure are quartz (Q), 
lawsonite (Law), kyanite (Ky), zoisite (Zo) and sillimanite (Sill). Additional mineral 
abbreviations: albite (Ab), jadeite (Jd), paragonite (Par), brucite (Brc). Two variants 
of the metasomatic model were considered: with (Fig. 3c, d) and without 
Na-phlogopite (aspidolite’) for which thermochemical data are uncertain. In the 
absence of this phase, ApH (where X-=0.01-0.43) increases by ~0.5 unit without 
change to the main features of the diagram. Wet melting curves for basalt and pelite 
are indicative, and drawn from experimental data in refs 53-55, carbonated-basalt 
melting curve is from experimental data in ref. 16. 

P-T paths. Two model top-of-the-slab P-T paths represent cold (Central- 
Honshu referred to as ‘Honshw’) and intermediate (Columbia-Ecuador” referred 
to as ‘Columbia’) thermal regimes. An indicative hot P-T path representative of 
Archaean subduction has been computed with a mantle potential temperature of 
T=1,650°C and parameters from the Cascadia subduction zone”®. We thank P. 
Van Keken for computing and providing the P-T coordinates of this geotherm. 
In combination, these paths are reasonable approximations for a broad range of 
possible geotherms likely to be realized in subduction zones from the Archaean 
to the Phanerozoic?”. 

Solvent definition. Neutrally charged solvent species (for example, H2O, CO2, 
CHy) are described by non-ideal molecular solution models. They are used to 
compute the phase diagram sections (Fig. 1) and the solvent dielectric constant 
(see below). Two distinct models are used for C-saturated C-O-H solvents”® 
(graphitic pelite in Fig. 2b), and for binary H1O-CO)j solvents”? (carbonated 
basalt in Fig. 2b). They define the solvents in which solute speciation is computed. 
This strategy differs from previous approaches™*! in that C solubility predictions 
are consistent with independent constraints from both low-T congruent mineral 
dissolution (calcite) and high-T mixed-volatile carbonate-silicate incongruent 
dissolution experiments” over an extensive range of P, T and water activities. Our 
terminology differs in that congruent and incongruent carbonate dissolution at 
sub-solidus conditions are distinguished here, as opposed to dissolution versus 
decarbonation elsewhere). 

Speciation calculation. At equilibrium the oxide chemical potentials are linked 
to the partial molar Gibbs energy of corresponding dissociated species through 
general hydrolysis equilibria such as: 


Na2O + HO = 2Na‘ + 20H- (4) 


This equation, and equivalent ones for all solutes considered (Extended Data Table 2) 
generates a set of equations of the form presented in Equation (1) (see main text). 
Together with water hydrolysis: 


H,0 = Ht + OH- (5) 


charge balance in the fluid S Zjmj =0 (Z;is the ionic charge , m; the molality) and 


an activity model for dlectralyees, the equations governing fluid speciation are 
closed. 

The chemical potentials of oxide components are obtained by differentiation of 
the extensive Gibbs energy of the system. It is assumed that the solutes do not affect 
the rock composition and mineralogy. This is a reasonable assumption in high-P 
subduction zone environments characterized by elevated rock/fluid ratio of interest 
here. The standard-state Gibbs energy of aqueous species is computed with the 
Helgeson-Kirkham-Flowers equation of state (EoS)*°*** using the Born model, 
extended here to mixed-volatile carbonic solvents”!, adopting the convention 
goe = 0 (see main text). The thermochemical database for aqueous species and 
sources of those data are compiled in Extended Data Table 2. Consistency between 
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mineral and solute thermochemical database is judged adequate for the present 
purpose. Further discussions can be found in refs 21, 35, 69, 70. 

Dissolved solutes that exert a dominant control on charge balance, ionic strength 
and computed pH, are Na*, OH and HSiO, at all conditions for the pelite and 
basalt fluids, and Ca*t, CAOH* and OH” (and AlO;j along the hot geotherm) for 
the peridotite fluids. We estimate the precision on absolute ApH values is +0.2 
ApH (mainly due to uncertainty of thermodynamic data). In absence of experi- 
mental validations, the predictive accuracy is within 1 ApH unit. Relative varia- 
tions and discontinuities are robust. 

Fluid dielectric properties, standard state and activity model for solutes. The die- 
lectric constant (static relative permittivity) of water is from refs 19, 20, 71 and 72, 
used in the Born dielectric continuum model of electrolyte solvation”. Those of the 
mixed-volatile solvents are computed with a method developed in ref. 21. It uses the 
Landau-Lifschitz-Looyenga mixing scheme”””*’° with volume fractions of solvent 
components computed with a reduced density method”. Input solvent density at 
critical and target P, T conditions are derived from the molecular fluid EoS (see above). 

The energetics of the solutes in going from pure H2O to a mixed-volatile 
solvent characterized by €;ix, is incorporated to the standard-state Gibbs energy of 
the solute’”’®, that is, it affects the equilibrium constants of homogeneous oxide- 
solutes equilibria in solution. Solute standard-state is unit activity of lmol of 
solute in 1kg of solvent, referenced to infinite dilution. Aqueous species molal 
concentration is given in mol per kg of solvent. 

Non-ideality for electrolytes follows the Davies extension of the Debye—Hiickel law 
in its original 1938 formulation”. The ionic strength of the solution (J) is linked to the 


Azz /2 
activity coefficient (7j) of the jth anion or cation by log y= renICA = 0.2251 , 
1+ 


where I= 5 SoZ m, A= 1.82483x10°p!/?(EmixT)*/?, my refers to the mola- 
j 


lity of the ion, A is the Debye-Hiickel parameter, and pix and €mix are respectively the 
density and dielectric constant of the solvent. Using a coefficient C= 0.3 (ref. 79) 
instead of C=0.2 (ref. 80) in the empirical term ‘CZ Fi ’ introduced no noticeable 
change to the results presented. The activity coefficient for neutral solutes is 1. The 
concentration of chlorine (for example, Fig. 2c, f) is set by fixing the total chlorine 
(Cl) content of the fluid (Cl~, HCI, NaCl -: KCI solutes) to 3°Cl=1 molal. Further 
technical details can be found in ref. 21. 

C flux estimates along cold and hot paths. Although subduction zone elemental 
budgets are sensitive to lithosphere hydration, overall length of subduction 
trenches, composition of subducting materials, geotherms and thus fluid fluxes, 
provisional estimates can be derived combining element solubility (this work) with 
fore-arc (<100km depth, ref 81) and sub-arc (100 to 150 km depth) H2O flux 
computed from ref. 81, assuming 2% HO in the upper 4km of the oceanic mantle 
(Extended Data Table 3). Focusing on carbon, assuming a total length of 
3.85 x 10’ m of subduction trenches, a cold subduction (model based on the cen- 
tral Honshu system) results in a global annual sub-solidus C flux, combining 


forearc and sub-arc fluxes, of 6, = 1047 Mt yr_!. This corresponds to a net 


equivalent O flux beyond subarc depth of about 105 +50 Mt yr“! (or redox budget 
flux (see ref 6) of 14+7 Tmol yr“! assuming C® (elemental C) is the reference C 
oxidation state in the deep upper mantle), assuming a subduction C input of 
40-66 Mt yr! (refs 61). Conversely, along the hot geotherm (Figs 1, 3b), the forearc 
flux alone is 6, = 407 Mt yr“! that is within uncertainty of present-day sub- 
duction C input in ref. 61. This budget contrasts with cold regimes where up to 
85% of the C input is retained in slabs, in agreement with conclusions from refs 
82-85, and in partial contrast with challenging views®!. Hot and intermediate paths 
(Columbia, Fig. 1a) typical of the Precambrian and rare modern settings are 
the only scenarios where substantial, possibly quantitative, forearc decarbonation may 
occur. In the latter intermediate scenario, quantitative C loss may occur, yet not by 
sub-solidus dissolution, but by carbonate melting'*****, Slabs retaining ~40% of their 
initial C budget would cross the carbonated-basalt melting curve at sub-arc depth 
(Fig. la, see refs 16, 83, 86). Note that this temporal trend is consistent with previous 
interpretation of kimberlite-diamond isotopic signatures reflecting little recycled 
components in the mantle*!: The latter isotopic signatures may record C recycling 
under dominantly hot Archean and proterozoic subduction thermal regimes, and 
may not be representative of Phanerozoic and present-day subduction zones. 
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Extended Data Figure 1 | Properties of water at geological conditions. 
The relative permittivity” (¢,) and density®” (p) of pure water as functions of 
pressure and temperature. Adapted from ref. 21. 
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Extended Data Figure 2 | Oxide chemical potential across a peridotite-crust 
interface. Shown are profiles of oxide chemical potentials (i) in the metasomatic 
model (Fig. 3c, d) relative to that for component F (177) at 600°C and 2 GPa. 
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Extended Data Figure 3 | Relative activity of selected neutral species 
across a serpentinite-crust interface. Shown are relative activities 
fluid rock 
aj 1 F 
i), ewe 
“ / pr PT 
NaAlsi,O? and Nay$i408, as well as Na* and HSiO;~ / SiO}, in the fluid, at 
600°C and 2 GPa (see Fig. 3c, d). 


of neutral polynuclear clusters HAISiO$, 
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Extended Data Table 1 | Rock compositions used for phase equilibria computations 


SiOz Al,03 Na2O K,0 MgO CaO FeO H20 COz 


Pelite’ 65.6 18.17 1.89 3.87 2.87 1.19 6.37 sat.” 

Basalt" 51.6 16 2.6 0 7.7 11.3 10.8 sat. 

Basalt+C" 47.5 14.7 2.4 0) 7.1 10.4 9.9 5.85 2.02 
Peridotite” 46.11 4.52 ) 0) 39.05 3.26 7.04 sat. 


Fcomponent” 70.11 12.74 5.16 2.61 2.24 3.11 3.99 sat. 


Composition from ref. 36, the C-saturated pelite with fluid atomic ratio O/H =0.5 has a similar oxide composition. 
’Modified from ref. 38. 

“Modified from ref. 38. 

Composition from ref. 39. 

“Composition from ref. 41. 

‘sat., saturated. 
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Extended Data Table 2 | Thermodynamic data source for solutes used in this study 


Inorganic species C species Cl species 
SiO.” a KOH? d MgsiO;, A HCO; a Hcl’ a 
Si,02° a K a Fe* co;” a Nacl° a 
HSiO3 b KAIO.° e OH” a HCOO” a Kcl° a 
HAIO2° a Ca(OH)" a NaCO; a cr a 
Alo a Ca”* a NaHCO;° a 

Na(OH)° a CaHSiO;, ff CaHCco;* a 

Na* a CaSiO;° g MgHCo;* a 

NaAlo, sc Mg(OH)’ =a Mgco;° a 

NaHSiO;; a MgHSiO;° a CaCo;” a 


"Data as reported in the Deep Earth Water (DEW) database, (ref. 20). 

‘Data as reported in the DEW database, (ref. 20). 

“From ref. 88. 

As reported in ref. 20. 

©From ref. 89. 

‘Modified from ref. 90 to obtain a; parameter (HKF volumetric coefficient) consistent with the sum of the revised a; of Ca2* and HSiO3~. 
Estimates from data at 25°C for CaSiO3° from cement phase solubility, and wollastonite solubility from ref. 91 (compare ref. 92). 
hAssumed to have the same logK of formation as CaSi03°, from ref. 92. 
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Extended Data Table 3 | Estimates of global annual C fluxes 


C concentration 


H20 flux’ (Mt/yr) (mol/kg H20) C flux (Mt/yr) 
forearc  subarc? forearc subarc forearc subarc total 
Carelpetshi 277 177 0.3-3° 1-3° 544 442 1047 
Columbia:Fcuador 420 454 1-10 7-10" 28423 4648 74431) 
Mexico (‘Archean’) 474 235 4-10° = 7-10" 40417 2444 64421 
331 130 0.3-6" 3-6" 13411) 742 20414 


Global average 


Shown are estimates of global annual C fluxes derived from C solubility data (this study) and H20 fluxes from subducting slabs (ref. 81). 
*The global annual H20 flux for each subduction locality is obtained by multiplication of the annual H20 flux per metre of trench by the entire length of the 
present subduction trenches, that is, 38.5 x 10m (ref. 81). The global average is obtained from ref. 81. C concentrations for the global (present-day) average 


are indicative, intermediate between the Honshu and Columbia endmembers scenario. 
‘Subarc H20 flux are released from slabs between 100 and 150km depth. 


‘The value of 0.3 mol C per kg H20 is at ~500°C (Fig. 3a), and 3 mol C per kg H20 is a conservative estimate at ~650-700°C and ~3 GPa (Fig. 3a). 
‘The concentration range 1-3 mol C per kg H20 accounts for the decrease in C solubility with P increase along the high-T section of the P-T path. 
©The value of 4mol C per kg H20 is at 600°C, Fig. 3b, increasing to ~10 mol C per kg H20 at ~700-750°C. 
‘The H20 fluxes for the model Archaean scenario are those computed from the Mexico geotherm (ref. 81). 
“The solidus may be crossed along the path. Rock partial melting may significantly affect the values reported. 
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Variability in plant nutrients reduces insect 


herbivore performance 


William C. Wetzel'?, Heather M. Kharouba?’, Moria Robinson’, Marcel Holyoak* & Richard Karban® 


The performance and population dynamics of insect herbivores 
depend on the nutritive and defensive traits of their host plants’. 
The literature on plant-herbivore interactions focuses on plant 
trait means”, but recent studies showing the importance of plant 
genetic diversity for herbivores suggest that plant trait variance may 
be equally important**®. The consequences of plant trait variance 
for herbivore performance, however, have been largely overlooked. 
Here we report an extensive assessment of the effects of within- 
population plant trait variance on herbivore performance using 457 
performance datasets from 53 species of insect herbivores. We found 
that variance in plant nutritive traits substantially reduces mean 
herbivore performance via nonlinear averaging of performance 
relationships that were overwhelmingly concave-down. In contrast, 
relationships between herbivore performance and plant defense 
levels were typically linear, such that plant defense variance does 
not affect herbivore performance via nonlinear averaging. Our 
results demonstrate that plants contribute to the suppression of 
herbivore populations by having variable nutrient levels, not just 
by having low average quality as is typically thought. We propose 
that this phenomenon could play a key role in the suppression 
of herbivore populations in natural systems, and that increased 
nutrient heterogeneity within agricultural crops could contribute 
to the sustainable control of insect pests in agroecosystems. 

Decades of research have established the importance of plant 
nutritive and defensive traits for herbivore performanceand popula- 
tion dynamics’. Recent studies, showing that plant genetic diversity 
influences herbivore community patterns, suggest that plants influ- 
ence herbivores not just through average trait values but also through 
variance in trait values™®. The literature on plant defenses and herbi- 
vore nutritional ecology, however, focusesion mean relationships and 
mostly ignores the consequences of trait variance’. This is an over- 
sight because intraspecific plant trait variance pervades natural systems, 
from among tissues within individuals to among individuals within 
populations‘. In modérn agroecosystems, however, plant functional 
diversity has been replaced by extensive homogeneous monocultures of 
single crop varieties or genotypes’. How the loss of trait diversity influ- 
ences higherstrophic levels and ecosystems services like pest control is 
unexplored relative to how much is known about the consequences 
of genetic diversity*?. Elucidation of the direct effects of variability 
in plant defensive or nutritive traits on herbivore performance would 
inform management of agroecosystems—perhaps revealing new ways 
to useicrop heterogeneity for sustainable pest management—and 
advance our fundamental understanding of plant-insect interactions. 
Here we test for general patterns in the effects of plant trait variance 
on herbivore performance using 457 datasets relating plant traits to 
herbivore growth and survival for 53 species of phytophagous insects 
from seven orders. 

Plant variance could influence herbivores in several ways, includ- 
ing reducing the opportunity for herbivore populations to adapt 


evolutionarily to plant defenses'®. We focused on the ecological effects 
that occur via nonlinear averaging, a general phenomenon that poten- 
tially applies to all species and has been used successfully to under- 
stand diverse biological phenomenat)!4 In'this context, nonlinear 
averaging—also known as Jensen's inequality'’—allows us to predict 
the effects of plant variance on meamherbivore performance using the 
curvature of the relationship between plant trait values and herbivore 
performance (Fig. 1). Ifthe function relating herbivore performance to 
a plant trait is concave-down (decelerating), then herbivore populations 
experiencing variance in that trait will have lower mean performance 
relative to herbivores experiencing constant levels of the trait at the 
same trait méan’. Injcontrast, variance enhances mean performance 
when herbivore performance functions are concave-up (accelerating). 
When performance functions are linear, plant trait variance has no 
effect. Three reviews have attempted to generalize the effects of plant 
trait variance on herbivore performance by visually assessing and 
categorizing the curvature of herbivore performance functions in pub- 
lished studies**"!. They came to contradictory conclusions, perhaps 
because they had small sample sizes (< 12 studies) and lacked objective 
methods for quantifying curvature. 

We found 76 papers published between 1968 and 2014 that allowed 
us to estimate herbivore performance functions (Supplementary 
Tables 1 and 2; Extended Data Fig. 1). These papers reported growth 
or survival of herbivores at > 4 values of a plant trait. Our search only 
included studies that directly manipulated trait values in a laboratory 
setting (e.g., via artificial diet, or applying compounds to plant surfaces) 
to avoid inclusion of spurious correlations. We estimated a performance 
function for each dataset with a cubic spline (e.g., Fig. 2, A and B). 
The spline from each dataset allowed us to quantify the effect of trait 
variance on herbivore performance, specifically the difference in per- 
formance in the presence and absence of plant trait variance (Jensen's 
effect). We predicted performance in the absence of variance using the 
value of the performance function at the mean of the plant trait levels 
in the dataset. We predicted performance in the presence of variance 
using the mean of the values of the performance function at each of the 
trait levels (Extended Data Fig. 2). This approach assumes the original 
authors chose trait levels reflective of trait distributions encountered by 
herbivore populations in nature. Indeed, many studies reported field 
data justifying their range of values. In nature the magnitude—but not 
the sign—of Jensen’s effect will depend on the plant trait distribution 
and resource selection behavior; however, even herbivore populations 
with highly selective resource behavior will experience plant variance 
due to costs and limits of discrimination and intraspecific competition’. 
We tested sensitivity of our conclusions to the shape of the trait distri- 
bution by repeating the entire analysis assuming uniform and Gaussian 
distributions. 

First, we asked how the consequences of plant trait variance for 
herbivore performance differed for plant defensive traits (e.g., concen- 
trations of toxic secondary metabolites) versus plant nutritive traits 
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(e.g., protein concentration). Physiological theory predicts the rela- 
tionship between nutritive traits and herbivore performance should be 
concave-down!*!’ (Fig. 2D), but makes less clear predictions about the 
relationship between defensive traits and performance!®"9 (Fig. 2C). 
Despite the centrality of this question to insect physiology and broader 
theory in plant-insect interactions, we know of no other quantitative 
analyses of shapes of empirical performance curves. 

We found that relationships between plant nutrients and herbivore 
growth and survival were consistently concave down, whereas rela- 
tionships between plant defenses and herbivore growth and survival 
were close to linear on average. The curvature of nutrient-performance 
relationships led the experimentally- generated variability in nutrients 
to reduce mean herbivore growth by half a standard deviation rel- 
ative to plants with a similar mean nutrient value but zero nutrient 
variance (Jensen's effect [95% CI] = -0.49 [-0.62, -0.35]) (Fig. 3A). 
The mean effect of nutrient variance was more than half a standard 
deviation more negative (-0.55 [-0.66, -0.44]) than the mean effect of 
defense variance, which was near zero (0.063 [-0.055, 0.18]; x1 =92.6, 
P < 0.0001). This difference suggests there is less influence of variance 
in plant defenses on herbivore performance compared to variance in 
nutritional content. Mirroring the results for growth variables, studies 
of herbivore survival had negative Jensen's effects for nutritive traits 
(-1.41 [-2.29, -0.53]) and effects near zero (linear) for defensive traits 
(0.28 [-0.24, 0.81]; x7; = 19.2, P< 0.0001) (Fig. 3B). 

These results indicate that there are consistent constraints on herbivore 
physiology that lead to concave-down nutrient-performance relation- 
ships (Fig. 3, D and F) and depressed growth and survival in the face 
of plant nutrient variability (Fig. 3 A and B). This finding supports the 
theoretical prediction that performance for most consumers increases 
with nutrients but then plateaus due to diminishing returns, or even 
declines at high nutrient levels due to nutrient toxicity!*!”. In contrasts 
the generally linear declines in performance with increasing levels of 
plant defense indicate that defense variability has little effect on herbis 
vores via nonlinear averaging (Fig. 3, C and E). This finding contradicts 
recent predictions about the ubiquity of hormesis—beneficial, stimu- 
latory effects of low doses of toxins'’—or indicates that hormesis may 
occur only at lower doses than were tested by studies in our sample. 
It also indicates that defense thresholds, above which,herbiyore per- 
formance declines precipitously, are uncommon, which implies that 
increasing plant investment in a given defense will consistently decrease 
herbivore performance. 

Second, we asked how these relationships differed among herbivore 
species with different resource-selection behaviors. Because the conse- 
quences of plant variance for anherbivore population depend not only 
on the shape of the herbivore performance function but also on the 
amount of plant variance the population encounters, we hypothesized 
that curve shapes might be different for herbivore species with differ- 
ent mobility and host breath—two traits that influence how herbivore 
species encounter plant variance. We found, however, that mobility 
did not influence the patterns described above for growth (nutrients: 
x7; = 0.269 P=0.61; defenses: x7; = 0.10, P=0.75) and survival 
(defenses: x75= 0.45;'P = 0.80) (Extended Data Fig. 3). Herbivore host 
breadth also wasnot a significant predictor of Jensen's effect for growth 
(nutrients: 7; = 1.04, P=0.31; defenses: x7; =0.18, P=0.67) or survival 
(defensés:7| = 1.25, P=0.26) (Extended Data Fig. 4). Sample sizes 
were too small to test nutrient effects on survival. These results suggest 
that the shapes of nutrient and defense performance curves are funda- 
mental constraints regardless of insect life history traits. 

Our findings indicate that plants may contribute to the suppression 
of herbivore populations not only by having low average quality but also 
by having heterogeneity in nutrient levels. Concave-down nutrient- 
performance functions may be an important link between herbi- 
vore physiology and the negative relationships commonly observed 
between plant diversity and herbivore density at ecosystem scales”’. 
A key implication is that agroecosystems may experience outbreaks 
of herbivores because herbivore performance is elevated by artificially 
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low plant heterogeneity, due to landscape simplification, reduced plant 
species diversity, and crops bred to minimize variation. Increasing 
heterogeneity in plant nutrients in agroecosystems may be a key step 
towards the sustainable control of insect pests. Plant nutrient het- 
erogeneity could be increased by planting greater numbers of crop 
varieties”', by increasing genetic diversity within crop varieties, or by 
breeding varieties with elevated constitutive or induced nutrient var- 
iance within plant parts attacked by insect pests. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references,uniquerto 
these sections appear only in the online paper. 
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Constant plants 
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Figure 1 | An illustration of Jensen’s inequality. The shape of the 
relationship between herbivore performance and a plant trait influences 
the consequences of trait variance for mean performance (p) via nonlinear 
averaging or Jensen’s inequality. The left column (A,C,E) represents plant 
populations where all plants have one trait value and no variance (plants at 
one location on x-axis). The right column (B,D,F) represents populations 
with trait variance, where half of the plants have a high trait value and half 
have a low value (two plants on x-axis). The trait mean, however, is the 
same in the constant and variable populations. With any linear function, 
trait value changes result in proportional changes in performance. Mean 
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Variable plants 


herbivore performance is therefore equal in the absence or presence of 
variance (no Jensen's effect; compare A,B). With nonlinear performance 
functions (C-F), however, trait value changes do not result in proportional 
performance changes, and mean performance will differ in the absence 
and presence of trait variance. When the relationship is concave-down, 
mean performance will be lower in the presence of trait variance (negative 
Jensen's effect; compare C,D). When the relationship is concave-up, mean 
performance will be higher in the presence of trait variance (positive 
Jensen's effect; compare E,F). Doubled-headed arrows show differences in 
mean performance with and without plant variance. 
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Figure 2 | Empirical and theoretical performance curves. Growth 

data from empirical/studies and fitted growth curves for (A) Heliothis 
virescens (Lepidoptera: Noctuidae) across a range of levels of various 
plant secondary metabolites and (B) Helicoverpa zea (Lepidoptera: 
Noctuidae) across a range of levels of various plant nutrients. Data are 
standardized to zero mean and unit standard deviation. Red curves are 
significantly concave-down, blue ones are concave-up, and pink ones are 
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linear. Different symbols denote different experiments. (C,D) Theoretical 
predictions for relationships between plant traits and herbivore 
performance. For simplicity, (D) shows one curve with an intermediate 
maximum, but curves that asymptote at high nutrients are also possible. 
Both shapes are concave-down and would result in negative effects of 
nutrient variability. 
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on herbivore growth (A) and survival (B). Each point is one herbivore are significantly concave-down (- Jensen's effect). Pink curves are linear 
species (N= 53), jittered for visibility. Diamonds and error bars show means _ (0 Jensen’s effect). Blue curves are significantly concave-up (+ Jensen's 
and 95% confidence intervals. Growth effects are standard deviations. effect). Curves are standardized to be on the same scale. 


Survival effects are log odds-ratios. (C-F) The empirically-estimated 
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METHODS 


Literature search. We located papers with data relating plant traits to herbivore 
performance by conducting keyword searches in Web of Science up to September 
2014, collecting studies cited in relevant reviews,*!!” and searching papers known 
to the authors. Studies were included in our analysis if they met the following 
criteria: 1) a plant trait was experimentally manipulated and directly related to 
insect growth or survival; 2) at least four levels of the plant trait were established; 
3) for all continuous response variables, some estimate of variance in herbivore 
performance was provided (e.g., SE) along with the mean and sample size at each 
treatment level; 4) for binomial survival response variables, studies provided an 
initial number of individuals at each treatment level and a count, proportion, or 
percentage that survived; and 5) data on traits and herbivore performance could be 
retrieved from a table, figure, text, or supplement”’. (See Supplementary Methods 
for additional methods.) 

Data collection. From each suitable paper, we recorded species, plant traits, and 
herbivore growth and survival variables measured. We categorized plant traits as 
defenses or nutrients based on the original studies. (See Extended Data Fig. 1 for 
a summary of the database, including a list of all plant traits; see Supplementary 
Tables 1 and 2 for a list of the papers that met our criteria and a list of the herbivore 
species). For growth responses, we collected the mean and variability (e.g., SE) of 
the herbivore response and sample size at each level of the plant trait. For survival 
responses, we collected the initial number of herbivores and the count, proportion, 
or percent that survived at each level of the plant trait. 

Effect calculation. We used a bootstrapping approach to calculate a distribution 
of Jensen’s effects for each empirical observation (Extended Data Fig. 2). Having a 
distribution of effects for each observation allowed us to estimate variance for each 
observation and quantify our uncertainty for each estimate. For survival responses, 
we used nonparametric bootstrapping. We resampled each survival dataset with 
replacement 10,000 times. We fit cubic splines to each bootstrap dataset using 
the mgcv package in R 3.2.4** and calculated a Jensen’ effect from each spline. 
We calculated Jensen's effect as the log odds ratio of the mean of the predicted 
survival probabilities at each plant trait level and the predicted survival probability 
at the mean plant trait level. This measure, the log odds ratio, is widely used to 
express effect sizes in meta-analyses of response probabilities””. The log odds ratio 
is beneficial because it puts survival on the logit scale, which accurately represents 
survival as a multiplicative process. 

For growth data, which were typically reported as means and standard errors 
at each plant trait level instead of raw data, we used parametric bootstrapping 
(Extended Data Fig. 2). We parameterized a log-normal distributiomfor herbivore 
growth at each level of the plant trait using the reported herbivore performance 
means and standard errors. We then drew values from each distribution until 
the length of our bootstrap response vectors equaled the sample sizes reported 
at each level of the plant trait. We repeated this procedure to/obtain, 10,000 boot- 
strap datasets. Then we followed the curve fittingmethodsdeseribed above but 
first log-transformed the responses and then used a Gaussian error distribution. 
We calculated Jensen’s effect for growth by subtracting the predicted herbi- 
vore performance for the mean level of the plant. trait (the expected herbivore 
performance in the absence of plant trait variance) from the mean of the predicted 
herbivore performances at each plant trait level (the expected herbivore perfor- 
mance accounting for trait variance and nonlinear averaging). We standardized this 
difference by dividing it by the standard deviation of the bootstrapped herbivore 
performances to enable comparison across studies. This measure thus expresses 
the effect of nonlinear averaging insterms of standard deviations of herbivore 
performance and isjanalogous to/Hedges’ d, one of the most widely used meta- 
analysis effect sizes””. (See Supplementary Discussion: R Script for the computer 
scripts used in'this analysis.) 

Effect of Trait.Distribution. Our approach assumes that the values of the plant 
traits tested’by the authors of each study reflect the natural distribution of trait 
values. We believe this is justified because most authors stated that they chose trait 


levels representative of those in nature or provided data showing correspondence 
between natural trait means and variances and experimentally chosen trait levels. 
Regardless, we also repeated the entire analysis assuming two different trait 
distributions: a uniform distribution between the minimum and maximum of 
the plant traits tested by the authors, and a Gaussian distribution with a mean 
equal to the midpoint of the author-chosen doses and a standard deviation that 
aligned the maximum author-chosen dose with 0.975 of the Gaussian cumu- 
lative probability function. The results were similar for each of the three trait 
distributions we tested, so in the main text we present the results based on the 
analysis that assumes a trait distribution defined by author-chosen trait levels. 
(See Supplementary Discussion for the results of the analyses with uniform and 
Gaussian distributions). 

Statistical modeling. We tested our hypotheses using linear mixed effects models 
in the metafor package”* in R 3.2.4”. The response variable was,the mean of the 
Jensen’s effect distribution from each empirical dataset; We examined differences 
between variability in plant defensive and nutritive traits/by fitting models with 
plant trait type (nutrient or defense) as an independent variable. We examined 
the effects of herbivore mobility and host-breadth by including mobility and host- 
breadth as independent variables. We tested the significance of trait type, mobility, 
and host-breadth as predictors of Jensen's effectsyusing an omnibus test based 
ona chi-squared distribution. We used.random intercepts for herbivore family 
and genus to account for potential correlations due to shared evolutionary history 
among genera within a familyand species within a genus. We used this approach 
because a reliable phylogenetic tree does not exist for this diverse group of insects. 
We used an additional random intercept to account for the non-independence of 
multiple Jensen's effects measured on the same species, and we included a ran- 
dom effect for each,obseryvationfollowing the standard practice of random effects 
meta-analysis*?. Finally, thesampling variance of each observation was set equal 
to the estimated variance of the distribution of Jensen's effects generated by our 
bootstrapping procedure. We explored differences in Jensen’s effect among insect 
orders and found them to be minimal (Supplementary Discussion: Results by 
Insect Order). 

Publication, bias. Publication bias was unlikely to be an issue in our analysis 
because we used data for a different goal than did authors of the original 
studies, none of whom estimated Jensen’s effects or curvature. Regardless, we 
explored the potential for publication bias graphically and found no evidence 
suggesting particular curve shapes were more likely to be published than 
others (Extended Data Figs. 5 and 6; Supplementary Discussion: Analysis of 
Publication Bias). 

Data availability. Data used in the analysis have been deposited at http://dx.doi. 
org/10.6084/m9.figshare.3792117. 
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Extended Data Figure 1 | Graphical summary of database. Number of herbivore species per (A) order, (B) mobility of feeding stage, and (C) host breadth. 
Number of herbivore performance curves per (D) trait type, (E) defense class, (F) nutrient class, (G) date of publication, and (H) study sample size. 
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Extended Data Figure 2 | Visual representation of quantitative methods. This diagram summarizes the bootstrapping algorithm used to calculate 
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survival see Methods and Supplementary Methods. 
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Extended Data Figure 3 | Jensen’s effects by plant trait type (defenses readily move among neighboring host plants but do not typically move 
and nutrients) and mobility of the feeding stage. Defense variance had between patches. Species within the ‘region’ category commonly move 
mean effects near zero and nutrient variability had generally negative among host plant patches. Each point is one herbivore species, jittered 
effects regardless of the mobility of the feeding stage of the herbivore for visibility. Diamonds and error bars show means and 95% confidence 
species. Species in the ‘plant’ category move within plant individuals but intervals. See Supplementary Materials and Methods for more detail. 
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Extended Data Figure 4 | Jensen’s effects by plant trait type (defenses Polyphagous species (‘poly’) feed on plant species across two or more 
and nutrients) and host-breadth. Defense variance had mean effects near _ plant families. Each point is one herbivore species, jittered for visibility. 
zero and nutrient variability had generally negative effects regardless of the | Diamonds and error bars show means and 95% confidence intervals. 
host breadth of the herbivore species. Oligophagous species (‘oligo’) feed See Supplementary Materials and Methods for more detail. 

on plant species in multiple genera but are restricted to one plant family. 
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Extended Data Figure 5 | Funnel plots for (A) growth and (B) survival. The lack of a relationship between the sample size of a study and its Jensen’s 
effect suggests publication bias did not have a major influence on the results. Dashed line shows zero. Solid lines show linear regressions for growth 
(Fi,248 = 0.23, P= 0.63, R? = 0.0) and survival (F203 = 1.04, P=0.31, R? =0.0). 
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Extended Data Figure 6 | Jensen’s effect for each observation by the year of publication for (A) growth and (B) suryival. The lack of temporal trends 
in Jensen’s effects suggests publication bias did not play a major role. 
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Neuromodulators signal through astrocytes to alter 
neural circuit activity and behaviour 


Zhiguo Mal, Tobias Stork!, Dwight E. Bergles? & Marc R. Freeman} 


Astrocytes associate with synapses throughout the brain and express 
receptors for neurotransmitters that can increase intracellular 
calcium (Ca”*)!"3. Astrocytic Ca”* signalling has been proposed 
to modulate neural circuit activity’, but the pathways that regulate 
these events are poorly defined and in vivo evidence linking changes 
in astrocyte Ca’* levels to alterations in neurotransmission or 
behaviour is limited. Here we show that Drosophila astrocytes 
exhibit activity-regulated Ca" signalling in vivo. Tyramine 
and octopamine released from neurons expressing tyrosine 
decarboxylase 2 (Tdc2) signal directly to astrocytes to stimulate 
Ca?* increases through the octopamine/tyramine receptor 
(Oct-TyrR) and the transient receptor potential (TRP) channel 
Water witch (Wtrw), and astrocytes in turn modulate downstream 
dopaminergic neurons. Application of tyramine or octopamine to 
live preparations silenced dopaminergic neurons and this inhibition 
required astrocytic Oct-TyrR and Wtrw. Increasing astrocyte Ca?* 
signalling was sufficient to silence dopaminergic neuron activity, 
which was mediated by astrocyte endocytic function and adenosine 
receptors. Selective disruption of Oct-TyrR or Wtrw expression in 
astrocytes blocked astrocytic Ca”* signalling and profoundly altered 
olfactory-driven chemotaxis and touch-induced startle responses. 
Our work identifies Oct-TyrR and Wtrw as key components of the 
astrocytic Ca** signalling machinery, provides direct evidence that 
octopamine- and tyramine-based neuromodulation can be mediated 
by astrocytes, and demonstrates that astrocytes are essential for 
multiple sensory-driven behaviours in Drosophila. 

Astrocytes are intimately associated with brain synapses and posi- 
tioned to broadly regulate synaptic activity. It has been widely proposed 
that astrocytes modulate neural circuits and behaviour? ’, although 
in vivo evidence that astrocytes are directly activated by neurotrans- 
mission and signal back to neurons to modulate their output remains 
elusive. Astrocytes exhibit dynamic fluctuations in intracellular Ca’* 
in vitro*? and in vivo'”", suggesting that Ca** signalling might be a 
useful measure of astrocytic activity. Despite decades of studies on 
astrocytic Ca”* transients, the signalling pathways that control these 
transients remain poorly defined, and their in vivo relevance remains 
controversial. If astrocytes actively participate in information processing 
in circuits, it is imperative that we characterize the mechanisms 
involved as they would represent a potentially widespread mechanism 
for controlling brain function. 

We performed an RNA interference (RNAi)-based screen in 
Drosophila in which we individually knocked down around 500 Ca?t 
signalling-related genes selectively in astrocytes using the astrocyte- 
specific Gal4 driver line alrm-Gal4 (ref. 12), and assayed larval olfactory- 
driven chemotaxis (Extended Data Fig. 1a). We found that astrocyte 
knockdown of the TRP channel Wtrw" led to a significant (~50%; 
P<0.01) decrease in larval chemotaxis towards iso-amyl acetate (IAA) 
(Fig. 1a). Similar results were found with a second, non-overlapping 
RNAi construct (Fig. 1a); the effect of Wtrw knockdown was specific 


to glia, as shown by insensitivity to blockade of Gal4/UAS in neurons 
expressing elav-Gal80 (Extended Data Fig. le). Larvae bearing the null 
allele wtrw™ were equally defective in chemotaxis, but this behaviour 
was rescued by re-expression of wtrw only in astrocytes (Fig. 1a). 
We also found that blockade of Gal4/UAS-driven wtrw®S™ with 
tsh-Gal80 in the ventral nerve cord (VNC) resulted in normal chem- 
otaxis responses (Extended Data Fig. 1d, f), revealing a critical role for 
VNC astrocytes in this behaviour, although this does not exclude an 
additional role for brain astrocytes. Chemotaxis defects did not result 
from simple alterations in motility, as wtrw®N4i larvae exhibited normal 
locomotion (Extended Data Fig. 1g) and light avoidance responses 
(Extended Data Fig. 1c, h). We conclude that Wtrw is required in astro- 
cytes for normal larval olfactory-driven behaviour. 

Ca’* transients are induced in astrocytes in awake behaving mice during 
periods of elevated arousal'*-'°. We therefore used a gentle anterior 
touch assay’” to investigate the role of astrocyte Ca?” signalling in 
larval startle-induced behaviours. Crawling larvae touched anteriorly 
with a hair responded by pausing and continuing forwards (type I 
response), or by moving backwards and executing an escape response 
(type II response; Extended Data Fig. 1b). Wild-type larvae and controls 
without RNA knockdown exhibited roughly equal frequencies of type I 
and type II responses, but the expression of wtrw®S“ in astrocytes 
resulted in a marked alteration in behaviour: about 80% of larvae exhib- 
ited type I responses. This phenotype was mimicked by wtrw™ mutants 
(Fig. 1b) and was independent of neuronal Wtrw expression (Extended 
Data Fig. 1i). These data indicate that astrocyte-expressed Wtrw also 
modulates startle-induced behavioural changes in Drosophila larvae. 

To study in vivo Ca** signalling, we developed a semi-dissected 
preparation to image the larval CNS. We used the fluorescent calcium 
sensor GCaMP6s to image astrocyte cytosolic Ca”* changes (UAS- 
GCaMPés) and mCherry (UAS-mCherry) as a reference for astrocyte 
position, and imaged dorsal astrocyte cell bodies in the VNC. Astrocyte 
cell bodies exhibited coordinated, population-wide slow oscilla- 
tions (termed somatic Ca** transients; Extended Data Fig. 1j-l and 
Supplementary Video 1). Somatic Ca** transients occurred approxi- 
mately every 2 min and exhibited an average change in AF/F of about 
11% (Extended Data Fig. 1m). Notably, blocking neuronal activity with 
tetrodotoxin (TTX) suppressed transients by approximately 60%, as 
did application of the broad Ca?* channel blocker lanthanum chloride 
(LaCl3; Extended Data Fig. 1n). Similar astrocytic Ca”* transients were 
observed when we imaged intact immobilized larvae (Extended Data 
Fig. 1o), indicating that our dissected preparation preserves in vivo 
patterns of astrocyte activity. 

TRP channels regulate Ca** levels in astrocytes!®, so we reasoned 
that Wtrw might drive Ca”* signalling in Drosophila astrocytes. Wild- 
type larvae exhibited 8-9 rhythmic oscillations in somatic Ca? tran- 
sients over 15 min. By contrast, astrocyte-specific wtrw®™ led to a 
roughly 50% decrease in somatic astrocyte Ca** transients, which 
was also observed in the wtrw* mutant (Fig. 1c). Bath application of 
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Figure 1 | Larval chemotaxis and startle-induced responses require 
the astrocyte-expressed TRP channel Wtrw. a, Chemotaxis assay 

(n= 12). b, Gentle touch assay (n = 30). c, Pseudocoloured maximum 
intensity projections of 15-min movies, averaged traces of 16 individual 
astrocytes and quantifications of the frequency of somatic Ca’* transients 
(n= 10, 160 cells total). Scale bar, 501m. d, Responses of astrocytes to 
neurotransmitters and neuromodulators in the presence of TTX (n=6, 
96 cells total). *P < 0.05, **P < 0.01; NS, not significant; error bars, s.e.m. 
a, Wilcoxon and Mann-Whitney tests followed by Bonferroni-Holm 

post hoc test; b-d, One-way ANOVA followed by Tukey’s post hoc test. 
Ach, acetylcholine; GABA, y-aminobutyric acid; Glu, glutamate. 


acetylcholine, glutamate, or \-aminobutyric acid in the presence of 
TTX did not elicit a change in Ca’* levels in astrocytes. By contrast, 
application of tyramine (Tyr) or octopamine (Oct), the invertebrate 
analogues of norepinephrine, which has been shown to induce Ca”* 
transients in mammalian astrocytes'*'>!°”°, potently elevated somatic 
Ca?" in astrocytes (Fig. 1d), indicating that astrocyte somatic Ca** 
signalling is regulated by these neuromodulators. 

Tdc2* neurons are the only known source of Tyr and Oct in the larval 
VNC". To investigate the relationship of these neurons with astrocytes 
we expressed the red-shifted Ca?* indicator R-GECO1 in Tdc2* neu- 
rons (using tdc2-LexA/LexAop-R-GECO1) and GCaMP6s in astrocytes 
(using alrm-Gal4/UAS-GCaMP6s) and examined in vivo activity. We 
observed a striking positive correlation between Tdc2* neuron activity 
and somatic astrocyte Ca**: when Tdc2* neurons were active, astrocyte 
somatic Ca?* levels increased, and when Tdc2* neurons were silent, 
astrocyte somatic Ca** levels decreased (Fig. 2a and Supplementary 
Video 2). A similar correlation was observed in intact larvae (Extended 
Data Fig. 1p). Moreover, the amplitude and duration of the increase in 
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somatic astrocyte Ca”* was tightly correlated with Ca?" spikes in Tdc2t 
neurons (Fig. 2a). When we chronically silenced Tdc2* neurons by 
expressing the K* leak channel Kir2.1 (ref. 22), rhythmic oscillations 
in astrocyte Ca’* were eliminated (Fig. 2b and Extended Data Fig. 2a); 
in addition, acute optogenetic blockade of Tdc2* neuron activity using 
halorhodopsin (tdc2-Gal4 > UAS-NpHR) led to a decrease in astrocyte 
somatic Ca** (Fig. 2c and Extended Data Fig. 2b). Notably, astrocytic 
Ca? signalling was also blocked by the al adrenergic receptor antag- 
onist terazosin (Fig. 2d and Extended Data Fig. 2c), which inhibits 
arousal-evoked Ca?* responses in cortical astrocytes in mammals!*}5, 
further supporting the notion that astrocyte modulation is highly con- 
served between species. Consistent with astrocyte Ca”* signalling being 
regulated by Oct and Tyr, somatic astrocyte Ca** transients were nearly 
absent (about 80% reduced) in tdc2®°*4 mutants, which lack both Tyr 
and Oct, although they persisted in tons mutants, which lack Oct but 
retain Tyr signalling (Fig. 2e, f). Finally, Tdc2+ neurons were activated 
when olfactory neurons were optogenetically stimulated in intact 
larvae (using or83b-Gal4), suggesting that sensory cues flow to Tdc2* 
neurons, which are upstream of astrocyte Ca”* signalling (Extended 
Data Fig. 2d). 

To determine whether Tdc2* neurons signal directly to astrocytes we 
knocked down the two Tyr receptors, TyrR and TyrRII, and Oct-TyrR, 
a dual specificity receptor that can bind both Oct and Tyr?*4, in astro- 
cytes and assayed Ca*” signalling. Depletion of TyrR or TyrRII had no 
effect (data not shown), but depletion of Oct-TyrR strongly suppressed 
astrocyte somatic Ca** transients. We observed strong inhibition of 
astrocyte somatic Ca* transients in the Oct-TyrR"”° homozygous 
mutant, and in Oct-TyrR'"°/+ heterozygous mutants, which were 
previously shown to have dominant effects” (Fig. 2g). Loss of one 
copy of Oct-TyrR (Oct-TyrR"?"°/+) also led to defects in chemotaxis 
behaviour similar to those observed after astrocyte-specific knockdown 
of wtrw; these behavioural changes were enhanced in the Oct-TyrR'"° 
homozygous mutant, or by astrocyte-specific knockdown of Oct-TyrR 
with two copies of an Oct-TyrR RNAi construct. Co-expression of 
Oct-TyrR®N“' and wtrw®N“! did not enhance chemotaxis defects, sug- 
gesting that Oct-TyrR and Wtrw act in a common genetic pathway 
(Fig. 2h). Finally, astrocyte-specific depletion of Oct-TyrR led to strong 
inhibition of type II escape responses in the gentle anterior touch assay 
(Fig. 2i). We conclude that Wtrw and Oct-TyrR are required in astro- 
cytes to trigger somatic Ca” transients in response to Oct and Tyr, and 
astrocytic Wtrw and Oct-TyrR are essential for normal olfactory-driven 
chemotaxis and startle-induced escape responses. 

Astrocytes support neuronal function in many ways and it is plausible 
that alterations in astrocyte Ca”* signalling might modify upstream 
Tdc2* neuron firing to influence behaviour. We therefore measured the 
frequency and amplitude of firing events in Tdc2* neurons in wtrw™* 
null mutants and found no alterations compared to wild-type controls 
(Extended Data Fig. 2e). That Tdc2* neurons fire normally in wtrw 
mutants indicates that blocking astrocytic Ca’* signalling does not lead 
to a global disruption of neuronal activity in the larval CNS. 

Biogenic amines often regulate dopaminergic neurons”. Triple label- 
ling revealed that astrocytes, Tdc2* neurons and dopaminergic neurons 
are in close proximity in the larval CNS (Extended Data Fig. 2f). 
When we imaged dopaminergic neuron activity (using th-Gal4/UAS- 
GCaMP¢6s) we found that wtrw™* mutants exhibited a significant 
increase (about 50-70%; P < 0.01 for tl, P< 0.05 for t2 and t3) in the 
frequency of dopaminergic Ca”* transients in multiple segmental 
clusters of tyrosine hydroxylase (TH)* cells. A similar phenotype was 
observed in Oct-TyrR'"*’* mutants (Fig. 3a), although there were no 
significant changes in the average amplitude of dopaminergic neuron 
responses (Extended Data Fig. 2g). Thus Wtrw and Oct-TyrR nega- 
tively regulate dopaminergic neuron firing. Consistent with this obser- 
vation, bath application of Oct or Tyr, but not vehicle (DMSO), to the 
larval CNS suppressed dopaminergic neuron activity within seconds 
(Fig. 3b). Strikingly, depletion of the Oct-TyrR and Wtrw specifically 
from astrocytes using RNAi potently suppressed Tyr-induced Ca?* 
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Figure 2 | Tdc2* neurons fire rhythmically to drive astrocyte somatic 
Ca** transients through the Oct-TyrR. a, Correlation analyses of 
neuronal activity and somatic Ca** transients in astrocytes. Averaged 
traces of 16 individual astrocytes (GCaMP6s) and 4 pairs of Tdc2* 
neurites (R-GECO1) from the same sample. Vertical orange bars highlight 
concomitant activity in astrocytes and neurons (fevents shown in inset). 
Amplitude and duration of individual Ca?* transients in astrocytes 
correlate highly with activity of Tdc2* neurons. b, Frequency of Ca?* 
transients in astrocytes (n= 10, 160 cells total). c, d, Relative changes in 
numbers of Ca? transients after acute blockade of Oct and Tyr signalling 


elevation in astrocytes and silencing of dopaminergic neurons (Fig. 3c 
and Extended Data Fig. 2k). These data indicate that Tyr signals 
through astrocyte Oct-TyrR and Wtrw to inhibit downstream dopa- 
minergic neuron activity. 

To investigate how modulation of dopaminergic neurons regulates 
larval chemotaxis behaviour, we performed a series of acute manipu- 
lations. We first expressed the temperature-sensitive channel TrpA1 
in dopaminergic neurons; after 24-h ectopic activation of dopamin- 
ergic neurons, larvae exhibited profound chemotaxis defects similar 
to those observed when Wtrw or Oct-TyrR were depleted from astro- 
cytes (Extended Data Fig. 2h). In addition, feeding larvae a dopamine 
D1-like receptor antagonist rescued deficits associated with wtrw®NAi 
in astrocytes in chemotaxis assays (Extended Data Fig. 2i). These data 
support the notion that the increased dopaminergic neuron activity 
observed after depletion of astrocytic Oct-TyrR and Wtrw is responsi- 
ble for defects in chemotaxis behaviour. 

How do astrocytes regulate dopaminergic neuron firing? Astrocytic 
release of ATP and signalling through purinergic receptors is one mech- 
anism by which astrocytes appear to modulate neuronal activity?””®. 
The Drosophila genome contains a single seven-transmembrane- 
domain purinergic receptor, AdoR, that is most similar to mammalian 
adenosine receptors”’. Pretreatment of larval CNS preparations with 
the AdoR antagonist SCH-442416 blocked Tyr-induced silencing of 
dopaminergic neurons without altering overall dopaminergic neuron 


by halorhodopsin (c) or terazosin (d) (n= 6, 96 cells total). Black dots 
denote points at which either 561 nm light or terazosin was administered. 
e, Representative traces of astrocytic Ca** transients in mutants defective 
in Oct and Tyr signalling. f, g, Frequency of Ca”* transients in tdc2**4, 
tGh"™!8 (f, n=6, 96 cells total) and Oct-TyrR"”’ mutants (g, 1 listed for 
each genotype, 16 cells randomly selected and analysed for each sample). 
h, Chemotaxis assay (n = 12). i, Gentle touch assay (n = 30). *P < 0.05, 
**P < 0.01; NS, not significant; error bars, s.e.m. c, d, Paired t-test; 

b, f-i, one-way ANOVA followed by Tukey’s post hoc test. 


activity before Tyr treatment. Similar results were observed in AdoR 
null mutants (Fig. 4a and Extended Data Fig. 2j). Purinergic signalling 
often results from vesicular release of ATP, which can be hydrolysed to 
adenosine by ectoenzymes. We manipulated exocytosis selectively in 
astrocytes using a dominant-negative version of the dynamin Shibire 
(ShiPX) and found that this suppressed the ability of Tyr to silence 
dopaminergic neurons. This blockade was not enhanced further by 
application of AdoR antagonists and had no effect on baseline dopa- 
minergic neuron activity before Tyr treatment (Fig. 4b and Extended 
Data Fig. 21). Finally, to determine whether astrocyte Ca?" signalling 
through TRP channels is sufficient to suppress dopaminergic neuron 
firing, we expressed the wasabi (AITC)-sensitive Drosophila TrpA1 
channel in astrocytes and tested the effects of AITC application on 
astrocytic Ca** signalling and dopaminergic neuron firing. Exposure 
of control preparations to AITC did not alter astrocyte Ca”* levels, 
but application of AITC to preparations expressing TrpA1 in astro- 
cytes led to an increase in astrocytic GCaMP6s signals of around 20% 
(Extended Data Fig. 2m), similar to that elicited by Tyr application 
(Fig. 1d). Moreover, activation of astrocyte Ca** signalling by AITC 
resulted in strong suppression of dopaminergic neuron activity. This 
effect was reversed by application of AdoR antagonists and did not occur 
when AITC was applied to control (that is, non-TrpA1 expressing) 
preparations (Fig. 4c and Extended Data Fig. 2n). Thus TrpA1 channel- 
mediated increases in astrocytic Ca”* signalling are sufficient to silence 
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Figure 3 | Astrocytes mediate Tyr-induced inhibition of dopaminergic 
neurons through Oct-TyrR. a, Enhanced activity of dopaminergic 
neurons in wtrw* mutant and Oct-TyrR'""’* heterozygote larvae (n = 10; 
80 neurites). s, subesophageal segments; t, thoracic segments. b, Tyr and 
Oct (2.5 mM) inhibit the activity of dopaminergic neurons (n= 6; 

48 neurites). c, Astrocyte-specific RNAi for Oct-TyrR or wtrw attenuates 
inhibition of dopaminergic neurons by Tyr (1 = 6; 48 neurites). Black 
dots, points at which Tyr was perfused. *P < 0.05, **P< 0.01; NS, not 
significant; error bars, s.e.m. b, c, Paired t-test (b, same treatment; 

c, same genotype); a—c, one-way ANOVA followed by Tukey’s post hoc test. 


dopaminergic neurons in vivo. Silencing of dopaminergic neurons is 
likely to be mediated by release of ATP from astrocytes and subsequent 
activation of AdoR on dopaminergic neurons. 

It has been assumed that neuromodulators alter neural circuit activity 
and behaviour by acting directly on neurons; however, astrocytes 
(and other glia) express a diverse array of neuromodulatory receptors, 
suggesting they may contribute to the global effects of these neuro- 
transmitters. In this study we have demonstrated that neuromodu- 
latory signalling in at least some cases flows through astrocytes. We 
have shown that Drosophila astrocytes exhibit robust Ca’~ signalling 
events that are remarkably similar to those observed in awake behaving 
mice!*!®, Similar to the global activation of astrocyte Ca”* signalling 
by norepinephrine in mammals, release of Tyr (and probably Oct) 
in vivo triggers synchronous activation of astrocytes. We have identified 
Wtrw and Oct-TyrR as key components of the astrocyte Ca”* signal- 
ling axis, and their loss-of-function phenotypes have provided direct 
evidence that astrocytic Ca** signalling modulates animal behaviour. 
Furthermore, we have shown that, during suppression of dopamin- 
ergic neuron activity, Oct and Tyr signal directly to astrocytes, and 
astrocyte Ca’** transients execute neuromodulatory events downstream 
(Fig. 4d). Oct and Tyr have been linked to arousal and aggression in 
insects and are considered to be functionally equivalent to norepi- 
nephrine in mammals. The profound modulation of astrocytic Ca?* 
levels by norepinephrine in mammals'*'° suggests that activation of 
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Figure 4 | Tyr-mediated inhibition of dopaminergic neurons depends 
on adenosine receptors and glial endocytic function. a, AdoR is 

required for Tyr-mediated inhibition of dopaminergic neurons (n = 6; 

48 neurites). b, Blockade of endocytic function by dominant-negative 
shibire (shi?) attenuates inhibition of dopaminergic neurons by Tyr (n= 6; 
48 neurites). Black dots, points at which Tyr was perfused. c, Ca”* influx 

to astrocytes through TrpA1 is sufficient to inhibit dopaminergic neurons 
(n=6; 48 neurites). Black dots, points at which AITC was perfused. 

d, Model. Olfactory and (probably) mechanosensory information flow 
towards Tdc2* neurons, which release Oct and Tyr to activate Oct-TyrR 
on astrocytes and in turn induce astrocytic Ca”* entry through the 

TRP channel Wtrw. The increase in astrocytic Ca** is sufficient to 

silence dopaminergic (DA) neurons through a mechanism requiring the 
adenosine receptor AdoR, potentially through astrocyte ATP release and 
its breakdown to adenosine. Inhibition of dopaminergic neuron activity by 
astrocytes is essential for normal chemotaxis and startle-induced reversal. 
*P<0.05, **P<0.01; NS, not significant; error bars, s.e.m. a—c, Paired 
t-test (same genotype); Wilcoxon and Mann-Whitney tests followed 

by Bonferroni-Holm post hoc test (a, b); one-way ANOVA followed by 
Tukey’s post hoc test (c). 
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astrocytic Ca** signalling by Tyr, Oct or norepinephrine, and in 
turn astrocyte-based neuromodulation, are ancient features of the 
metazoan nervous system. A reassessment of the cellular basis of 
neuromodulation is therefore warranted, with further reflection on 
direct roles for astrocytes in transducing neuromodulatory signals 
in neural circuits. 


Online Content Methods, along with any additional Extended Data display items and 
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METHODS 

Fly stocks and husbandry. Flies were cultured in standard cornmeal food 
at 25°C in 12h-12h light-dark cycles. Fly stocks include: w8\ alrm-Gal4, 
alrm-LexA::GAD, UAS-wtrw-gfp, LexAop-R-GECO1 (Bloomington, 52224), 
tdc2-Gal4 (Bloomington, 9313), 20XUAS-IVS-GCaMPé6s (Bloomington, 
42746), 13XLexAop2-IVS-GCaMP6s-p10 (Bloomington, 44274), 
UAS-Kir2.1-EGFP (Bloomington, 6595), UAS-Oct-TyrR®S“! (Bloomington, 28332), 
th-Gal4 (Bloomington, 8848), UAS-wtrwYP®° (VDRG, 107423), Oct-TyrR™"” (Kyoto, 
109038), UAS-shiP (Bloomington, 5822), UAS-TrpA1 (Bloomington, 26263), 
UAS-NpHR (Kyoto, 117008), UAS-CsChrimson (Bloomington, 55135), 
AdoR*°"64« (Bloomington, 30868), tdc2-LexA::p65 (ref. 26), UAS-wirw? (ref. 13), 
wtrw® (ref. 30), tGh"™!8 (ref. 31), tdc2®O™4 (ref. 32), th-LexA::p65 (ref. 33), 
tsh-Gal80 (ref. 34). 

Chemotaxis and light avoidance. These two assays were performed using proto- 
cols described previously with minor modifications*. Briefly, pools of ~100 3rd 
instar larvae (108-120h after egg lay) were allowed to move freely for 5 min on Petri 
dishes with according settings for chemotaxis assay or phototaxis assay, then the 
numbers of larvae in odorant versus vehicle circles or in light versus dark quadrants 
were scored, respectively. Response indices were calculated as: RI chemotaxis = (Nodor — 
Nyehicle)/(Nodor + Nvehicle); Rlight avoidance = (Naark — Muight)/(Naark + Night). IAA was 
diluted to 1:5,000 in mineral oil, and pure mineral oil was used as the control. Light 
intensity in phototaxis assays was 2,400 lx white light (5,400 K). Thermogenetic 
manipulation of dopaminergic neurons was accomplished by transferring 3rd 
instar larvae from 18°C to 25°C to allow activation of TrpA1 for 24h before testing. 
For chemotaxis assays related to D1-like or D2-like dopamine receptor inhibitors, 
3rd instar larvae were fed on cornmeal food mixed with inhibitors for 24h before 
testing. 

Gentle touch assay. We struck early 3rd instar larvae (84-96 h after egg lay) in 
the thoracic segments with a hair while they were moving straight. No response, a 
stop, head retraction and turn were grouped into type I responses, and initiation 
of at least one single full body retraction or multiple full body retractions were 
categorized as type II reversal responses. 

Larval motility. Third instar larvae (108—120h after egg lay) were videotaped 
and the maximal velocity was calculated by dividing the travel distance measured 
when larvae moved straight without pause by the time taken. 

Molecular biology and transgenic flies. pUAST-wtrw::gfp was made by cloning 
full length wtrw tagged with gfp at its C-terminal into pUAST. Transgenic flies were 
obtained by standard germline injection (BestGene Inc.). 

Chemicals, buffers and antibodies. For live preparation studies, all chemicals 
were diluted in live imaging buffer (110 mM NaCl, 5.4mM KCl, 1.2mM CaCh, 
0.8mM MgCh, 10mM p-glucose, 10mM HEPES, pH adjusted to 7.2), including 
1M tetrodotoxin (TTX), 0.1mM lanthanum chloride (LaCl;), 2.5mM Oct, 
2.5mM glutamate, 2.5mM acetylcholine, 2.5 mM GABA, 150\1M AITC, 10.M 
SCH-442416, 1 mM terazosin and 0.5mM Tyr unless otherwise stated. The con- 
centration of SCH-23390 (dopamine D1-like receptor antagonist) was 0.2 mg/ml; 
eticlopride (dopamine D2-like receptor antagonist) was 0.2 mg/ml; all-trans retinal 
was 10mM. Rabbit anti-GAT antibody was used at 1:5,000 and mouse anti-nc82 
antibody was used at 1:50 (DSHB). 

Live imaging and data analysis. First instar larvae were immersed in halocarbon 
oil 27 and immobilized between coverslips and slides to image the activity of neu- 
rons and astrocytes through the cuticle in intact larvae. Third instar wandering 
stage larvae were used to dissect the CNS. Briefly, intact CNSs were transferred to 
a silicone filled dish after dissection and incubated with 1001] live imaging buffer. 
Samples were allowed to sit at room temperature for 15 min before imaging. To 
examine the effects of LaCl; or tetrodotoxin on calcium transients, samples were 


pre-incubated in buffer containing chemicals for 15 min before imaging. Movies 
were acquired by using Volocity software on a spinning disc microscope equipped 
with a 20x (dissected CNS) or 40x (intact larvae) water immersion objective at 
2 frames per s. Changes in GCaMP intensity of ROIs were calculated and expressed 
as (Fy... — F)/F (first 10 frames were averaged as F), which then were plotted over 
time. Plotted traces were semi-automatically analysed by the multiple peak fitting 
function of Igor Pro (WaveMetrics Inc.) to obtain the numbers and amplitudes 
of peaks. 

Bath application of drugs. Samples were prepared as described above and incu- 
bated with imaging buffer (with/without 1 |1M TTX). At ~240s (astrocytes) or 
~300s (neurons) after imaging, 1001] live imaging buffer containing 2x final 
concentrations of chemicals (with/without 11M TTX) were directly added to the 
medium, and imaging continued. The GCaMP6s intensity of ROIs was plotted 
and analysed as mentioned above. For terazosin experiments we first examined 
preparations for 6 min (control, pre-exposure window). Then terazosin was per- 
fused for 3 min, and we then imaged for an additional 6 min (exposed window). 
Data points of 6-9 min that represented the perfusion window were omitted 
from the graph. 

Optogenetic activation and inactivation of neurons. For CsChrimson acti- 
vation of olfactory neurons, larvae were grown on cornmeal food daily topped 
with 1001] all-trans retinal for 3 days. Early 3rd instar larvae were immobilized 
between coverslips and slides in halocarbon oil 27 and the activity of Tdc2* 
neurons reflected by GCaMP6s was continuously monitored except when red light 
(640 nm, 3.41 mW/mm’) was delivered through an epifluorescence light source. 
Halorhodopsin-mediated inhibition of Tdc2* neurons was accomplished by grow- 
ing larvae on cornmeal food topped daily with 100 1l all-trans retinal for 4-5 days. 
Larval brains were dissected from wandering 3rd instar larvae and prepared as 
described above for imaging somatic calcium transients in astrocytes. GCaMP6s 
signals were recorded for 6 min, then halorhodopsin was activated (within ~30s) 
by alternating delivery of 561 nm (0.26 mW/mm”, 3s, for photoinhibition) and 
blue light 488 nm (500 ms, for imaging GCaMP6s-labelled astrocytes) for another 
6 min. Data points from two consecutive movies were then assembled together to 
show changes in somatic calcium transients. 

Statistical analysis. No statistical methods were used to predetermine sample 
size. All n numbers represent biological replicates. Data were pooled from 2-3 
independent experiments. The experiments were not randomized or blinded. The 
Shapiro-Wilk test (normal distribution if P > 0.05) was used to determine the 
normality of data. Statistical comparisons were performed using one-way 
ANOVA followed by Tukey’s post hoc test or paired Student's t-test (two-tailed). 
Non-normally distributed data were compared using Wilcoxon and Mann- 
Whitney tests followed by Bonferroni-Holm post hoc test. P< 0.05 was considered 
significant. All the data in bar graphs are expressed as mean + s.e.m. 
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Extended Data Figure 1 | Synchronous somatic Ca’* transients in 
Drosophila astrocytes. a-c, Brief diagrams of behavioural tests. d, tsh-Gal80 
suppression of alrm-Gal4 activity. Nc82, neuropil (NP). Astrocytes (Astro), 
alrm > myr::tdTomato. Vnc, ventral nerve cord. Scale bar, 50m. 

e, f, Chemotaxis assay (n = 12). g, Locomotion assay (n listed). h, Light 
avoidance assay (n= 12). i, Gentle touch assay (n = 24). j, GCaMP6s and 
mCherry expression in astrocytes. Scale bar, 50 1m. k, Representative 
pseudocoloured images of four continuous Ca** transients. Scale bar, 
50m. I, Traces of normalized GCaMP6s intensity over mCherry of 

10 individual astrocytes in 15-min live imaging windows. m, Averaged 
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traces of individual somatic Ca?* transients. n, Somatic Ca** transients 

in astrocytes treated with TTX and LaCl; (n= 10, 160 cells). 0, GCaMP6s 
expression in astrocytes and traces of 10 individual astrocytes from an 
intact larva. Scale bar, 20\1m. Grey bars (1, 0) represent population rise or 
fall in GCaMP6s signals. p, GCaMP6s-labelled astrocytes and R-GECO1- 
labelled Tdc2* neurites and their averaged traces in an intact larva. Scale 
bar, 20m. *P < 0.05, **P < 0.01; NS, not significant; error bars, s.e.m. 
Wilcoxon and Mann-Whitney tests followed by Bonferroni-Holm post hoc 
test (e, n), one-way ANOVA followed by Tukey’s post hoc test (f-i). 
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Extended Data Figure 2 | Somatic Ca”* transients in astrocytes inhibit 
the activity of dopaminergic neurons. a—c, Representative traces of 
astrocyte Ca”* transients with blockade of Tyr and Oct signalling. 

d, Stimulation of olfactory neurons activates Tdc2* neurons (n= 3-4). 
Scale bar, 25 um. e, Activity of Tdc2* neurons is not altered in wtrw 
mutants (1 = 8, 48 neurites). f, Astrocytes, Tdc2* neurons and 
dopaminergic neurons in larval CNS. Dorsal (arrows point to astrocyte 
somas labelled with anti-GAT antibody), medial (neurites intermingled 
with ramified processes of astrocytes for monitoring activity are 
labelled) and ventral (cell bodies of tdc2>myr::tdTomato, Gal4/UAS 
and th>GCaMP6s LexA/LexAop dopaminergic neurons) images from 


the boxed region are shown (right). s, subesophageal. t, thoracic. Scale 

bar, 50jum. g, Amplitude of Ca?" spikes in dopaminergic neurons (n= 10, 
80 neurites). h, i, Chemotaxis assay (n listed). j, Number of Ca?* spikes of 
dopaminergic neurons (n =6, 48 neurites). k, Responses of astrocytes 

to Tyr (0.5 mM) in the presence of TTX (n= 6, 96 cells total). 1, Number 
of Ca?* spikes in dopaminergic neurons (n = 6, 48 neurites). m, AITC 
induces Ca** influx into astrocytes expressing TrpA1 (n =5, 80 cells). 
Scale bar, 501m. n, Number of Ca?* spikes in dopaminergic neurons 
(n=6, 48 neurites). *P < 0.05, **P < 0.01; NS, not significant; error bars, 
s.e.m. One-way ANOVA followed by Tukey’s post hoc test. 
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Transcription of the non-coding RNA upperhand 
controls Hand2 expression and heart development 


Kelly M. Anderson!?, Douglas M. Anderson!?, John R. McAnally!”, John M. Shelton?, Rhonda Bassel-Duby!” & Eric N. Olson!” 


HAND2? is an ancestral regulator of heart development and one 
of four transcription factors that control the reprogramming of 
fibroblasts into cardiomyocytes'*. Deletion of Hand2 in mice 
results in right ventricle hypoplasia and embryonic lethality’. 
Hand2 expression is tightly regulated by upstream enhancers” 
that reside within a super-enhancer delineated by histone H3 acetyl 
Lys27 (H3K27ac) modifications®. Here we show that transcription 
of a Hand2-associated long non-coding RNA, which we named 
upperhand (Uph), is required to maintain the super-enhancer 
signature and elongation of RNA polymerase II through the Hand2 
enhancer locus. Blockade of Uph transcription, but not knockdown 
of the mature transcript, abolished Hand2 expression, causing right 
ventricular hypoplasia and embryonic lethality in mice. Given 
the substantial number of uncharacterized promoter-associated 
long non-coding RNAs encoded by the mammalian genome’, the 
Uph-Hand2 regulatory partnership offers a mechanism by which 
divergent non-coding transcription can establish a permissive 
chromatin environment. 

In all well-annotated mammalian species, long non-coding RNA 
(IncRNA) transcripts with a similar intron-exon organization are 
transcribed upstream of the Hand2 locus (Extended Data Fig. 1a, b). 
We refer to these transcripts as ‘upperhand’ (Uph), although the human 
transcripts have been referred to as DEIN (differentially expressed in 
neuroblastoma, also known as HAND2-AS1)!°. Rapid amplification 
of cDNA ends (RACE) analysis of Uph revealed several alternatively 
spliced transcripts in the mouse heart (Extended Data Fig. 1c). The 
major mouse Uph transcript is 770 nucleotides long, contains six 
exons, a polyadenylation sequence and shares a bidirectional promoter 
with Hand2 (ref. 10; Fig. 1a). The mouse Uph locus encompasses 
~16.5 kilobases and contains two enhancers that direct Hand2 expres- 
sion in the heart and branchial arches®’, as well as histone acetylation 
marks that delineate a cardiac super-enhancer in this locus® (Fig. 1a). 

Similar to most characterized IncRNAs!', the Uph nucleotide 
sequence is not well-conserved, with only ~56% homology between 
mouse and human, and little conservation across other species 
(Extended Data Fig. 1a). However, Uph orthologues share a promoter 
and contain the conserved Hand2-associated cardiac and brachial arch 
enhancers within their second introns (Extended Data Fig. 1b). 

Promoter-associated IncRNAs are often positively correlated with 
transcription of their protein-coding neighbour!*'’. Indeed, whole- 
mount in situ hybridization for Uph in mouse embryos revealed 
expression in the heart, distal branchial arches and the developing 
limb bud at embryonic day (E) 10.5, overlapping with Hand2 expression 
(Extended Data Fig. 1d). Uph expression was strong in the heart and 
continued to mirror Hand2 at later fetal stages (Fig. 1b). Northern blot 
analysis across several adult tissues revealed that Uph transcript levels 
were highest in the heart (Extended Data Fig. le). 

Cell fractionation of mouse neonatal cardiomyocytes showed Uph 
transcripts in all fractions but enrichment in the cytoplasm, similar 
to 18S RNA (Extended Data Fig. 2a). The distribution of Uph differed 


from nuclear IncRNAs such as Malat1, which associates with chromatin 
(Extended Data Fig. 2a). Previously, we reported that some annotated 
IncRNAs can encode micropeptides'*'°. However, Uph does not 
contain a conserved open reading frame and in vitro transcription and 
translation of Uph did not produce any detectable peptides (Extended 
Data Fig. 2b). Together, these data reveal that Uph is a bona fide IncRNA 
that is co-expressed in a temporal and tissue-specific pattern with the 
essential cardiac transcription factor Hand2. 

To examine Uph function in vivo, we used transcription activator-like 
effector nucleases (TALENs) to insert a triple polyadenylation sequence 
into exon 2 of the Uph locus in C57BL/6 mouse zygotes (Extended Data 
Fig. 3a). To avoid potential regulatory elements required for Hand2 
expression, we inserted the triple polyadenylation cassette into a region 
with low sequence conservation. Furthermore, we generated a separate 
knock-in allele using a heterologous DNA sequence (tdTomato (tdTO) 
lacking a polyadenylation sequence) inserted into the same Uph locus 
(Extended Data Fig. 3b). Correct targeting of the triple polyadenylation 
and tdTO sequences was verified by Southern blot analysis (Extended 
Data Fig. 3c, d). 

Mice homozygous for the Uph tdTO allele (Uph'4?0/'41°) were born 
at expected Mendelian ratios and showed no morphological defects or 
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Figure 1 | Co-expression of upperhand (Uph) and Hand2. a, The mouse 
Uph-Hand2 locus. Uph is transcribed 150 bp upstream of Hand2 and 
contains two Hand2 enhancers (branchial arch enhancer (green) and 
cardiac enhancer (yellow)) within one of its introns. Bottom, mammalian 
sequence conservation (blue). b, Section in situ hybridization of E15.5 
mouse embryos for either Uph (left) or Hand2 (right). Expression was 
detected in heart (h), tongue (t) and jaw (j). Scale bars, 1 mm. 
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b Rostral-to-caudal sections 


Uph WT 


Uph KO 


Figure 2 | Cardiac defects of Uph KO embryos. a, At E10.5, Uph KO 
embryos have pericardial effusion (pe) and appear smaller than wild- 

type (WT) littermate embryos. b, At E10.5, wild-type embryos (top) 

have distinct right and left ventricular (rv and lv, respectively) chambers, 
whereas Uph KO embryos (bottom) have a single ventricle (v). This 
abnormality is highlighted in rostral-to-caudal sections through the heart 
(left to right). a, atria; nt, neural tube; oft, outflow tract. Scale bars, 500 jum 
(a) and 100m (b). 


notable changes in Uph or Hand2 expression (Extended Data Fig. 3e, f). 
Thus, insertion of a heterologous DNA sequence into exon 2 of the Uph 
locus is not sufficient to disrupt normal Hand2 expression. However, 
from 31 pups born of heterozygous Uph (Uph*') intercrosses, no Uph- 
knockout (KO) offspring survived, suggesting that Uph transcription 
is required for embryonic survival. 

To pinpoint the embryonic death of Uph KO embryos, we collected 
embryos from timed pregnancies of Uph*/~ matings. Uph KO embryos 
in a pure C57BL/6 background were present at approximate Mendelian 
ratios before E9.5 (20.6%; 13 out of 63 embryos collected), and showed 
no obvious size or morphological differences relative to wild-type litter- 
mates. However, by E10.5, the Uph KO embryos had a large pericardial 
effusion surrounding the heart and were growth restricted (Fig. 2a). 
Quantitative real-time PCR (qPCR) showed that Uph transcripts were 
reduced by 97% in Uph KO compared to wild-type hearts at E10.5, 
confirming successful termination of Uph transcription (Extended 
Data Fig. 3g). Histological analyses revealed that hearts from Uph 
KO embryos failed to develop a right ventricular chamber. Instead, 
the outflow tract and atrial chamber connected to a single left-sided 
ventricular chamber (Fig. 2b), a cardiac phenotype reminiscent of 
Hand2 KO embryos. 

Because Uph KO embryos displayed a similar cardiac phenotype 
to Hand2 KO embryos, we analysed the expression of Hand2 and 
other cardiac transcription factors in Uph KO embryos using 


@ —_£9.5 transverse 


E9.5 sagittal 


E10.5 transverse 


Uph WT 


Figure 3 | Loss of cardiac Hand2 expression in Uph KO embryos. 
a, In situ hybridizations of wild-type and Uph KO embryos at E9.5 
(transverse and sagittal sections) and E10.5 (transverse) show loss of 
Hand2 expression in the heart of Uph KO embryos, with detectable 
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in situ hybridization. Notably, Hand2 expression was absent in the 
atria, ventricle and outflow tract of Uph KO embryos, but remained 
detectable, albeit at reduced levels, in the branchial arches and limb 
buds (Fig. 3a, Extended Data Fig. 4a). Consistent with the persistent 
expression of Hand2 in the branchial arches, Uph KO embryos lacked 
the vascular defects in the aortic arch arteries and the aortic sac 
typically seen in Hand2 mutant embryos! (Extended Data Fig. 4b). 
Genes encoding other cardiac transcription factors essential for cardiac 
morphogenesis, namely Nkx2-5, Gata4 and the related Hand1, were 
not considerably changed in Uph KO embryos (Fig. 3b, Extended Data 
Fig. 4a), demonstrating the specificity of Uph transcription in regulating 
Hand2 expression. 

To determine whether Uph functions in cis or trans to regulate 
Hand2, we intercrossed mice carrying a Hand2 KO allele with mice 
carrying the Uph KO allele, to generate compound heterozygous 
(Uph*!~ Hand2*'~) embryos. Notably, Uph*!~ Hand2*!~ embryos 
recapitulated the Uph KO phenotype, displaying pericardial effusion 
and embryonic lethality by E10.5 (Fig. 4a, Extended Data Fig. 5a). 
While Uph expression was not reduced in compound heterozygotes, 
Hand2 expression was decreased by greater than 90% (Extended Data 
Fig. 5b). These findings demonstrate that expression of Hand2 in the 
heart is dependent on transcription of Uph in cis, which cannot be 
compensated for by Uph transcripts generated on the sister allele. 

Several IncRNAs have been proposed to function by recruiting the 
trithorax chromatin-modifying complex to gene loci’®!’. To determine 
whether Uph interacts with the trithorax complex, we performed RNA 
immunoprecipitation using an antibody against the RNA-binding 
adaptor of the MLL-1 trithorax protein, WDR5. While the ncRNA 
HOTTIP formed a stable complex with WDRS in the mouse HL-1 
cardiomyocyte-like cell line'®’, we could not detect an interaction 
between MLL-1 and Uph (Extended Data Fig. 5c). 

To determine whether the mature Uph RNA transcript is required for 
Hand2 expression, we performed knockdown of Uph using GapmeR 
antisense oligonucleotides’? in HL-1 cells and in the neuroblastoma 
cell line Neuro2a, both of which express Uph and Hand2 (Extended 
Data Fig. 6a). Knockdown of mature Uph transcripts by more than 
90% using two different GapmeR oligonucleotides in both cell lines did 
not alter Hand2 expression (Extended Data Fig. 6b, c). Because Uph is 
alternatively spliced, we performed qPCR across all Uph exon junctions 
(Extended Data Fig. 6d). We further verified that efficient knockdown 
was attained in the nuclear fraction of HL-1 cells (Extended Data 
Fig. 6e). Additionally, overexpression of the full-length Uph transcript 
in HL-1 cells using transient transfection had no effect on the levels of 
Hand2 mRNA (Extended Data Fig. 6f). Together, these data suggest 
that, in contrast to premature transcriptional termination of Uph 
in vivo, the mature Uph transcript is not required for Hand2 expression. 


b E9.5 sagittal E9.5 transverse 


E9.5 sagittal 


Uph WT 


Uph KO 


expression in the branchial arches (ba) and limb bud (Ib). b, In situ 
hybridizations of wild-type and Uph KO embryos at E9.5 (transverse 
and sagittal sections) using Hand1, Gata4 or Nkx2-5 probes. Scale bars, 
100 1m. 
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Figure 4 | Uph transcription in cis is required for active enhancer marks 
in the Uph locus and RNAPII elongation at the Hand2 gene. a, E10.5 
embryos heterozygous for either Uph (top) or Hand2 (middle) appear 
normal in size and morphology. By contrast, Uph*’~ Hand2*’~ double 
heterozygote embryos developed a single ventricle. Scale bars, 1 mm. 
b, Uph-Hand2 locus with letters (A or B) indicating GapmeR target sites, 
and numbers (1-3) indicating the regions analysed by qPCR following 
ChIP. The Hand2 branchial arch enhancer (green) and cardiac enhancer 
(yellow) are indicated. Shown in red are the ENCODE/LICR ChIP 
followed be sequencing (ChIP-segq) tracks for H3K4mel and H3K27ac 
active enhancer markers. E1-E6, exons 1-6. See Methods for source 
data. c, d, H3K4mel (c) and H3K27ac (d) modifications present at the 
Uph locus in E10.0 wild-type hearts are reduced in Uph KO hearts, and 
unchanged in Uph'4?°/¢T° homozygous hearts. Values are normalized to 
total histone H3. e, GATA4 binding to the cardiac enhancer is reduced 
in E10.0 knockout hearts relative to wild type. f, ChIP using RNAPII 
(phosphoS2) antibody at the Hand2 transcriptional start site or gene 
body in wild-type, Uph KO and Uph'*19/"¢T° homozygous hearts. Values 
normalized as a percentage of total input chromatin. Each point is one of 
three technical replicates of five pooled hearts for each genotype in each 
ChIP experiment, from one of two independent experiments; data are 
mean = s.e.m. 


DNA loci with high regulatory capacity such as enhancers”*”!, 


polycomb response elements”””? or locus control regions” are often 
associated with non-coding transcription and can function to maintain 
a permissive histone signature. In the developing heart, the super- 
enhancer at the Hand2 locus is defined by H3K27ac (Fig. 4b). To 
test whether Uph transcription is required to maintain the enhancer 
signature in the Uph—Hand2 locus, we performed chromatin immu- 
noprecipitation (ChIP) on Uph KO and wild-type E10.0 hearts, using 
H3K4mel and H3K27ac active enhancer marker antibodies. We found 
that the H3K4mel and H3K27ac peaks normally established at the 
Uph-Hand2 locus were reduced in Uph KO hearts (Fig. 4c, d). 

The GATA4 transcription factor is required for activation of the 
HAND2? cardiac enhancer in the Uph locus, and participates in estab- 
lishing active H3K27ac marks””°. We observed a reduction in GATA4 
binding to the Hand2 cardiac enhancer in Uph KO hearts, a finding 
consistent with the reduced H3K27ac enhancer marks in the Uph locus 
(Fig. 4e). We observed no differences in H3K4me1 or H3K27ac levels 
at an active enhancer in the promoter region of Nkx2-5, no differences 
in GATA4 binding to the Nkx2-5 promoter, and no differences in 
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H3K27me3 repressive marks between wild-type and Uph KO embryos 
at the Uph-Hand2 locus (Extended Data Fig. 7b-f). 

Enhancer-associated histone acetylation is necessary for gene 
expression by promoting RNA polymerase II (RNAPII) elongation 
at gene promoters”®. To measure the levels of RNAPII at the Hand2 
locus, we performed ChIP on wild-type and Uph KO E10.0 hearts 
using the Ser2-phosphorylated RNAPII antibody, a marker of RNAPII 
elongation, at the Hand2 locus. Although there was no difference in 
RNAPII recruitment to the Hand2 transcriptional start site, RNAPII 
binding to the body of the Hand2 locus was decreased in Uph KO hearts 
(Fig. 4f). We found no differences in Ser2-phosphorylated RNAPII 
binding to the Nkx2-5 gene body between genotypes (Extended Data 
Fig. 7g). These findings suggest that the loss of GATA4 and histone 
acetylation marks in the Uph-Hand2 locus of Uph KO hearts prevents 
RNAPII elongation within the Hand2 locus. 

Here we define the in vivo function of Uph, a cardiac-enriched 
IncRNA co-transcribed bi-directionally with the cardiac transcription 
factor Hand2. Termination of Uph transcription, but not of the mature 
transcript, resulted in the loss of Hand2 expression in the heart and 
partially phenocopied the lethal cardiac defects observed in Hand2 
KO embryos. Compound Uph*!~ Hand2*'~ heterozygosity causes 
embryonic lethality, suggesting that transcription of Uph is required 
in cis to promote Hand2 expression. Termination of Uph transcription 
in vivo reduced GATA4 binding and the Uph-Hand2 super-enhancer 
signature, resulting in RNAPII pausing and loss of Hand2 expression 
in the heart. We predict that transcription of Uph governs Hand2 
expression in all tissues for which the expression of Hand2 depends 
on enhancers located within the Uph locus. Future studies are needed 
to determine the role of Uph in the regulation of Hand2 in non-cardiac 
tissues, which are currently precluded owing to early embryonic 
lethality of the global Uph KO. Given the importance of Hand2 in the 
development of the palate, tongue, limbs, and sympathetic nervous 
system*” °°, Uph provides a potentially important non-coding regulatory 
mechanism for the temporal and spatial control of Hand2 expression. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 4 December 2015; accepted 29 September 2016. 
Published online 26 October 2016. 


1. Srivastava, D. et al. Regulation of cardiac mesodermal and neural crest 
development by the bHLH transcription factor, (HAND. Nat. Genet. 16, 
154-160 (1997). 

2. Han, Z,, Yi, P, Li, X. & Olson, E. N. Hand, an evolutionarily conserved bDHLH 
transcription factor required for Drosophila cardiogenesis and hematopoiesis. 
Development 133, 1175-1182 (2006). 

3. Song, K. et a/. Heart repair by reprogramming non-myocytes with cardiac 
transcription factors. Nature 485, 599-604 (2012). 

4. Conway, S. J., Firulli, B. & Firulli, A. B. A bHLH code for cardiac morphogenesis. 
Pediatr. Cardiol. 31, 318-324 (2010). 

5. Yamagishi, H., Olson, E. N. & Srivastava, D. The basic helix-loop-helix 
transcription factor, dHAND, is required for vascular development. J. Clin. 
Invest. 105, 261-270 (2000). 

6. Charité, J. et a/. Role of DIx6 in regulation of an endothelin-1-dependent, 
dHAND branchial arch enhancer. Genes Dev. 15, 3039-3049 (2001). 

7. McFadden, D. G. et al. A GATA-dependent right ventricular enhancer controls 
dHAND transcription in the developing heart. Development 127, 5331-5341 
(2000). 

8. Hnisz, D. et al. Super-enhancers in the control of cell identity and disease. Cel! 
155, 934-947 (2013). 

9. Lepoivre, C. et al. Divergent transcription is associated with promoters of 
transcriptional regulators. BMC Genomics 14, 914 (2013). 

10. Voth, H. et a/. Co-regulated expression of HAND2 and DEIN by a bidirectional 
promoter with asymmetrical activity in neuroblastoma. BMC Mol. Biol. 10, 

28 (2009). 

11. Chodroff, R. A. et al. Long noncoding RNA genes: conservation of sequence 
and brain expression among diverse amniotes. Genome Biol. 11, R72 (2010). 

12. Kambara, H. et a/. Regulation of interferon-stimulated gene BST2 by a IncRNA 
transcribed from a shared bidirectional promoter. Front. Immunol. 5, 676 
(2015). 

13. Uesaka, M. et al. Bidirectional promoters are the major source of gene 
activation-associated non-coding RNAs in mammals. BMC Genomics 15, 35 
(2014). 


00 MONTH 2016 | VOL 000 | NATURE | 3 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


14. Anderson, D. M. et a/. A micropeptide encoded by a putative long noncoding 
RNA regulates muscle performance. Cell 160, 595-606 (2015). 

15. Nelson, B. R. et al. A peptide encoded by a transcript annotated as long 
noncoding RNA enhances SERCA activity in muscle. Science 351, 271-275 
(2016). 

16. Grote, P. et al. The tissue-specific IncRNA Fendrr is an essential regulator of 
heart and body wall development in the mouse. Dev. Cell 24, 206-214 (2013). 

17. Wang, K. C. et al. A long noncoding RNA maintains active chromatin to 
coordinate homeotic gene expression. Nature 472, 120-124 (2011). 

18. Yang, Y. W. et a/. Essential role of IncRNA binding for WDR5 maintenance of 
active chromatin and embryonic stem cell pluripotency. eLife 3, eE02046 
(2014). 

19. Stein, C. A. et al. Efficient gene silencing by delivery of locked nucleic acid 
antisense oligonucleotides, unassisted by transfection reagents. Nucleic Acids 
Res. 38, e3 (2010). 

20. Kaikkonen, M. U. et al. Remodeling of the enhancer landscape during 
macrophage activation is coupled to enhancer transcription. Mol. Cel/ 51, 
310-325 (2013). 

21. Kim, T. K. et al. Widespread transcription at neuronal activity-regulated 
enhancers. Nature 465, 182-187 (2010). 

22. Schmitt, S., Prestel, M. & Paro, R. Intergenic transcription through a polycomb 
group response element counteracts silencing. Genes Dev. 19, 697-708 
(2005). 

23. Herzog, V. A. et al. A strand-specific switch in noncoding transcription switches 
the function of a Polycomb/Trithorax response element. Nat. Genet. 46, 
973-981 (2014). 

24. Ho, Y., Elefant, F., Liebhaber, S. A. & Cooke, N. E. Locus control region 
transcription plays an active role in long-range gene activation. Mol. Cell 23, 
365-375 (2006). 

25. He, A. et al. Dynamic GATA4 enhancers shape the chromatin landscape central 
to heart development and disease. Nat. Commun. 5, 4907 (2014). 

26. Jonkers, |. & Lis, J. T. Getting up to speed with transcription elongation by RNA 
polymerase II. Nat. Rev. Mol. Cell Biol. 16, 167-177 (2015). 

27. Yanagisawa, H., Clouthier, D. E., Richardson, J. A., Charité, J. & Olson, E. N. 
Targeted deletion of a branchial arch-specific enhancer reveals a role of 
dHAND in craniofacial development. Development 130, 1069-1078 
(2003). 


4 | NATURE | VOL 000 | 00 MONTH 2016 


28. Barron, F. et al. Downregulation of DIx5 and DIx6 expression by Hand2 is 
essential for initiation of tongue morphogenesis. Development 138, 
2249-2259 (2011). 

29. Charité, J., McFadden, D. G. & Olson, E. N. The bHLH transcription factor dHAND 
controls Sonic hedgehog expression and establishment of the zone of polarizing 
activity during limb development. Development 127, 2461-2470 (2000). 

30. Lucas, M. E., Muller, F., Ridiger, R., Henion, P. D. & Rohrer, H. The bHLH 
transcription factor hand2 is essential for noradrenergic differentiation of 
sympathetic neurons. Development 133, 4015-4024 (2006). 


Acknowledgements We thank D. Tennison for technical assistance and 

J. Cabrera for graphics. We thank the Histopathology Core Facility at University 
of Texas Southwestern for histology. This work was supported by grants from 
the National Institutes of Health (HL-O77439, AR-067294, HL-130253, 
DK-099653 and UO1-HL-100401), Fondation Leducq Networks of Excellence, 
Cancer Prevention and Research Institute of Texas and the Robert A. Welch 
Foundation (grant 1-0025 to E.N.O.), a pre-doctoral fellowship from the 
American Heart Association (14PRE19830031) to K.M.A. and a Muscular 
Dystrophy Association Development Grant (MDA377340) to D.M.A. 


Author Contributions K.M.A., D.M.A. and E.N.O. designed experiments and 
analysed data. J.R.M. generated mutant mice from constructs designed and 
created by K.M.A. and D.M.A. K.M.A. performed qPCR, phenotypic analysis, 
dissections, ChIP, RNA immunoprecipitation, cardiomyocyte fractionation, 
Southern blot, and India ink injections. D.M.A performed northern blot, RACE, 
in vitro transcription/translation, and GapmeR transfections. J.M.S. and K.M.A. 
performed in situ hybridization and generated images. K.M.A., D.M.A, E.N.O. and 
R.B.D. wrote and edited the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial 
interests. Readers are welcome to comment on the online version of the paper. 
Correspondence and requests for materials should be addressed to 

E.N.O. (eric.olson@utsouthwestern.edu). 


Reviewer Information Nature thanks H. Chang and the other anonymous 
reviewer(s) for their contribution to the peer review of this work. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


METHODS 


Animal models. All experimental procedures involving animals in this study 
were reviewed and approved by the University of Texas Southwestern Medical 
Center's Institutional Animal Care and Use Committee. Animals/embryos were 
allocated to experimental groups based on genotype and we did not use exclusion, 
randomization or blinding approaches. The sex of embryos used in these studies 
was not determined. In general, sample size was chosen to use the least number 
of animals/embryos to achieve statistical significance and no statistical methods 
were used to predetermine sample size. 

In situ hybridization. Whole mount in situ hybridization was performed using 
digoxigenin-labelled antisense RNA probes specific to Uph and Hand2, using 
methods previously described*!. Radioisotopic in situ hybridization studies on 
sections were performed as previously described*’. Antisense RNA probe templates 
for Uph or Hand2 were generated using mouse heart cDNA and subcloned into 
pCRII TOPO (Life Technologies). Primer sequences are listed in Extended Data 
Table 1. 

Subcellular fractionation of mouse neonatal cardiomyocytes. Neonatal 
cardiomyocytes were collected from ~50 post-natal day 1 (P1) C57BL/6 mice using 
the Neomyt kit (Cellutron, nc-6031) and fractionated as previously described*?. 
TALEN-mediated homologous recombination in mice. A TALEN pair specific 
for the Uph locus was designed using the ZiFiT Targeter Program (http://zifit. 
partners.org/ZiFiT/Introduction.aspx). Individual TALEN modules and donor 
vectors were constructed and assembled as previously described!*. TALEN 
mRNAs and circular DNA donor plasmids were diluted to 25ng jl! and 3ng yl}, 
respectively, and co-injected into the nucleus and cytoplasm of one-cell stage 
C57BL/6 zygotes and transferred into pseudopregnant mice. Cloning and 
genotyping primer sequences are listed in Extended Data Table 1. 

Histology. Embryos were fixed in 4% formaldehyde in PBS and processed for 
paraffin histology, sectioned and stained with haematoxylin and eosin using 
routine procedures. At least 5 embryos of each genotype were sectioned for all 
experiments and data reported are representative of all samples collected. 

ChIP. ChIP was performed using the SimpleChIP Kit (Cell Signaling, 9004) with 
the following adjustments to the standard protocol. In brief, embryonic hearts were 
dissected from E10.0 embryos and flash frozen in liquid nitrogen. On average, 
5 hearts from identical genotypes were pooled in a 1.5 ml tube before formal- 
dehyde cross-linking. Sonication was performed using a Bioruptor (Diagenode). 
The following antibodies were used: 10 xl H3K4mel (Abcam, ab8895), 1011 
histone H3 (Cell Signaling), 10 ul H3K27ac (Active Motif, 39133), 101 H3K27me3 
(Active Motif, 39155), 5,11 RNA polymerase II CTD repeat YSPTSPS (phospho S2) 
(Abcam, ab5095), 541 of GATA4 (sc-1237X). qPCR primer sequences are listed in 
Extended Data Table 1. 

GapmeR antisense oligonucleotide knockdown of Uph in HL-1 and Neuro2a 
cells. The mouse neuroblastoma cell line Neuro2a was purchased from ATCC 
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and authenticated by morphology. The cardiomyocyte-like cell line HL-1 was 
donated by the laboratory of W. Claycomb. Contamination from mycoplasma 
was not tested for in these cell lines. LNA longRNA GapmeR oligonucleotides 
specific for Uph were designed and purchased from Exiqon. HL-1 and Neuro2a 
cells were co-transfected with a plasmid encoding eGFP (CS2GFEP, 0.3 ng jl! final) 
and 50nM GapmeR antisense oligonucleotides using Lipofectamine2000 (Life 
Technologies), according to the manufacturer’s recommended protocol. Two days 
after transfection, GFP-positive cells were FACS sorted and processed using TRIzol 
(Life Technologies) for downstream gene expression analyses. GapmeR antisense 
oligonucleotide sequences are listed in Extended Data Table 1. 

RACE PCR. The sequences of Uph RNA transcripts expressed in the mouse 
heart were determined using 5’ and 3’ RACE PCR using commercially available 
Marathon-Ready cDNA (Clontech). RACE PCR was performed according to 
manufacturer’s recommended protocol, using primers specific to exon 4 of Uph 
and the marathon adaptor primer 1 (AP1). 5’ and 3’ Uph primer sequences are 
listed in Extended Data Table 1. 

RNA immunoprecipitation. RNA-protein complexes from formaldehyde- 
cross-linked HL-1 cell lysates were immunoprecipitated using 211 IgG (Cell 
Signaling) or 2411 WDRS5 (Cell Signaling) antibodies. RNA purification and 
expression analyses were performed as previously described**. 

Statistics. Unless otherwise indicated in the figure legends, error bars represent 
s.e.m. of at least 3 biological replicates of each genotype or sample. 

Data availability. Expressed sequence tag (EST) sequences for alignment of Uph 
transcripts were downloaded from GenBank: rat (BF567084), pig (DN117952), 
dog (DN434464) and cow (DN282558), or from RefSeq: human (NR_003679.2). 
Previously published ChIP-seq data used in Fig. 4b and Extended Data Fig. 7a, b 
are available under accession code GSE31039 (ref. 35). Source data for Extended 
Data Figs le, 2b, 3c, dare available in Supplementary Fig. 1, and all other data that 
support the findings of this study are available from the corresponding author 
on request. 
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Rat TITGGGCAGCACTATG--CICAGAAGTCATTCAACTTCCIGCCAICICCCTARACTIGIAT 263 
Human TACCACTGC-—-CTICICACTCCAGAAAG—-ACAAACTGC——--—--—--—--—-——-, ‘AGAT 402 
Pig GGIGGA-GCCAGTATGIGAACACAAGC: —CACCITC- -CICIACAGCTCATGI 3596 
Dog TACATCIGC—-TTGIGACTCCAGAAAG--ACAGACTTC— 


Cow TGCGAGRCGAIGICCGCCTCOGGCACT-———-; AGCICI— 
- «@ * * oe * 
Mouse T--—-GGCCAAG-GCAGGICIG—-—-ATTICTAITTATTCTAAR————— GACAGAGTI-—-— 382 


Rat TCICIGGCARAATGAAGGTCIG-—-A-—--GACGTTICIAAAR— 
Human T---—-GOCCAACTGCCIITCAR---ACTGITITTIACTIGCAT—- 
Pig TCTITGARTCATCAGICAATGAAGGCAATGCACCIATCGGGGAG 


—-GACAGAGTI-—-— 308 
GRACARACTCAAC 450 
AGGGGAGTI-AGC 450 


Dog T--—-GGCCRAGGCCCAGTITA-—-AATGCOGCCICCICCA —GA---AGCC-——— 386 
Cow C--—--GCITGGCGC--ATCGGA--AGTTIGICITICIGGAGICCCAGGCAGAGGIGGAC 457 
Mouse ——-GCAGATCCCCTACCATICCICTIGATI-CACATCAC— 
Rat ——-GGARATGAACCATCATIGCIGTIGACA-CGACATGCC— 


Human CAAGAAAATATATCTCGATICTICTATAT—-CGCACAAT— 


Dog = —------ TGTCCAGC--CICCTEGAGCCOCEECCCGAC— 
Cow TGGGAARATCGA-——-——GACCCTCGAGCEGCACCCAGAAC— 
* * *” * 


Mouse TCACC-TGAGCCC--AT---AC-AGCTCCGCTICACAGATCTCGCIAGTCITGCT— 72 
Rat TCACC-TGAGIGC-—RT-—-AT-AGC— ——GTTCTACT 76 
Human CTCCTCTGATTITIAAT—-—-ACTAGCA. —TARBATATARCAATCITITIATI-— 540 
Big TTACCATGCGCITACCATGGCAC-AGCT TCCRCAGR-—-AGCTGCTCCCATTITGCGAA 563 
Deg -GC-EGC---— —--—--—-BCACCECACCACTG: 450 
Cow TIICGCTGCTACTICT: --CCRGRATCATCTGCECITTICACACRCCCTACTTIS- 546 


* ry . * 


Mouse CCACCTCGCACTCTCCA--CATICTACCCAACTCACTGCAGTCC-TICATGCTICGACICA 529 
Rat Oca -CTAGCARR —GCB- -TCACACT-GGAATCA 404 
Human CICG—--IGATRTTCTITI--TIICCTAAATGAA— TGCRTIAGIGIAACRATICCICICA 591 
619 
474 
72 


Pig  ACATGTATCATTATCTTATTICCITGITCAAC— CICATGTICCTACTIC 
GCCTCGGTE- -CCCOGAGCCGAGCCE 47 
GGGGAR- --CACCGATACCGGCCA 57 


Mouse BACKCTTTARCCACTGCACACGITT--—ATGIARTAGGATIGRCATATGCAG: -RICAG 583 


Rat 

Human 

Pig 

Dog f 

Cow CRCRACABRATTCOGGGAATICGAGGIGCCCIG 603 
* 

Mouse BAGARATGAAT---—--GAAR GATTAGAGTGGGAGABAGIGCITGCIGCCAAGCITGAT 635 


Rat TGGATIGAIT- GAR —-GRGARACTIG-— 
Human TGACATGATTICIGIARACICTGACTTIGCICAGAAABARGCTC— 


BABRALCCTT--- 696 


Pig -TARITCTAGGGAARG— ‘BRRGGATART -RGBAICCCGAT 696 
Dog GCEGAGSTG— ‘GCEGNECCC--- 503 
Cow = --CRAGGTCIG-- 627 


Mouse GACCTARATTGAATACCIGGARCCTACACGACACRARGAGAGRGCTIGACTGCIGHAAGAT 695 
Rat 
Human 
Pig 
Dog 


Cow 


698 


Extended Data Figure 1 | Sequence alignment of several mammalian 
Uph transcripts. a, Sequence alignment of several mammalian Uph 
transcripts performed using ClustalW. See Methods for source data. 

b, Diagram of the Hand2 locus in mammals showing the genomic 
organization and orientation of Uph. The Hand2 branchial arch enhancer 
(green) and cardiac enhancer (yellow) are shown. c, Diagram of the Uph 
transcripts expressed in the mouse heart, determined using 5’ and 
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3’ RACE using primers specific to exon 4 of Uph. AP1, marathon adaptor 
primer; 3’GSP, 3’ Uph-specific primer from exon 4; 5’GSP, 5! Uph-specific 
primer from exon 4. d, Whole mount in situ hybridization of E10.5 mouse 
embryos. Expression was detected in heart, branchial arches (arrowhead), 
and limb bud. Scale bars, 1 mm. e, Northern blot analysis of total RNA 
from adult mouse tissues using a probe specific to the major Uph 
transcript. For gel source data, see Supplementary Fig. 1. 
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Extended Data Figure 2 | Uph is a cytoplasmic IncRNA. a, Subcellular myoregulin (MLN) micropeptide was used as a positive control, and 
fractionation of 18S, Uph and Malat1 IncRNA in mouse neonatal myoregulin with a frameshift mutation (MLN RNA-FS) was used as a 


cardiomyocytes (n = 3 biological replicates from 1 of 5 independent 
experiments; mean + s.e.m.). b, In vitro transcription and translation 
of a plasmid encoding the major Uph RNA. A plasmid encoding the 


negative control. In contrast to myoregulin, Uph and the negative control 
(MLN RNA-FS) did not produce any detectable proteins, indicating that 
Uph is a bona fide IncRNA. For gel source data, see Supplementary Fig. 1. 
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Extended Data Figure 3 | Targeting strategy for insertion of 
transcriptional termination or heterologous sequence into exon 2 of 
Uph. a, Uph KO targeting strategy. Transcription activator-like effector 
nucleases (TALENs) were used to insert a triple polyadenylation (tpA) 
termination sequence into exon 2 (E2) of Uph. b, Using the same TALEN 
pair as in a, we introduced the coding sequence of tdTO, lacking a 
polyadenylation sequence, into exon 2 of the Uph locus. Exon 2, which 
includes the tdTO coding sequence, was spliced out of the mature Uph 
transcript, preventing expression of tdTO in these mice. c, Southern blot 
analysis of wild-type, heterozygous Uph*/~ (Het) and Uph KO genomic 
DNA. BamHI-digested DNA hybridized with a 5’-specific probe and 
Ndel-digested DNA hybridized with a 3’-specific probe. For gel source 
data, see Supplementary Fig. 1. d, Southern blot analysis of wild-type, 
Uph'*T0'+ heterozygous and Uph'*T0"4T© knock-in (KI) genomic DNA 
verified the correct targeting of the tdTO sequence into the Uph locus. 
DNA was digested with BamHI and hybridized with a 5’-specific probe or 
digested with Ndel and hybridized with a 3’-specific probe. For gel source 
data, see Supplementary Fig. 1. e, f, Expression of Uph (e) and Hand2 (f) 
in wild-type and Uph'4?0/'¢T0 homozygous mice at E10.0 was not changed 
by the insertion of the tdTO sequence into exon 2 of the Uph locus (n= 3, 
representative of 3 independent experiments; mean + s.e.m.). g, GPCR 
shows Uph transcripts decreased by ~97% in E10.5 hearts (n = 3 mice of 
each genotype from 1 of 3 independent experiments; mean + s.e.m.). 
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Extended Data Figure 4 | Aortic arch arteries are normal in Uph KO 
embryos. a, qPCR quantification of gene expression at E10.0 showed 
robust downregulation of Uph and Hand2 expression in Uph KO hearts, 
with normal expression of other cardiac transcription factors (n = 3 mice 
of each genotype from 1 of 3 independent experiments; mean + s.e.m.). 

b, India ink was injected into either the left ventricle of wild-type embryos 
or the single ventricle of Uph KO embryos at E10.5, to visualize the aortic 
arch arteries and circulation, which appeared normal in Uph KO embryos. 
aa, aortic arch arteries; as, aortic sac; da, dorsal aorta. Scale bars, 1 mm. 
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Extended Data Figure 5 | Upht/— Hand2*'— compound heterozygotes 
recapitulate Uph KO phenotype. a, Uph*/~ Hand2*'~ double 
heterozygote embryos developed a single ventricle, pericardial effusion, 
and were severely growth restricted by E11.5. Scale bars, 1 mm. b, Uph 
expression is normal in double heterozygotes, whereas Hand2 was reduced 
by ~90%, relative to wild-type embryos (n=5 mice for wild type, Hand2 


UPH +/- ; Hand2 +/- 


® Wild type 
= Hand2 Het 
4 UPH Het 
Y Double Het 
* Hand2 KO 


het and Uph het, n = 3 for double het, n =2 for Hand2 KO, from 1 of 

2 independent experiments; mean +s.e.m.). c, Inmunoprecipitation of 
RNA using either IgG or WDRS in HL-1 cells followed by qPCR for Uph 
revealed no binding to WDR5. The WDRS5-interacting ncRNA HOTTIP 
was used as a positive control (n = 3 biological replicates from 1 of 3 
independent experiments; mean + s.e.m.). 
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Extended Data Figure 6 | Knockdown of mature Uph transcripts in qPCR (n=2 for ASO A and B, n=3 for control, from 1 of 2 independent 
HL-1 or Neuro2a cells does not affect Hand2 expression. a, Expression experiments; mean +s.e.m.). e, Fractionation of HL-1 cells transfected 
of Uph (red) and Hand2 (blue) in various tissues and cell lines. Uph and with control or Uph-specific ASOs. The nuclear fraction of antisense- 


Hand2 are robustly expressed in the hea 


lines. Uph and Hand2 transcripts are not expressed in the liver, 10T1/2 


rt, and the HL-1 and Neuro2a cell oligonucleotide-treated HL-1 cells showed a similar downregulation 


of Uph transcripts (n = 3 biological replicates from 1 of 3 independent 


fibroblasts or skeletal muscle C2C12 myotubes (n = 3 technical replicates; experiments; mean + s.e.m.). f, Overexpression of enhanced green 


mean + s.e.m.). b, c, Uph transcripts were reduced by ~90% in HL-1 (b) 
and Neuro2a (c) cells when transfected with two independent GapmeR 


antisense oligonucleotide probes (ASO A and B) against Uph. Expression Hand2 expression relative to wild 
of Hand2 was not changed (n =2 for ASO A and B, n=3 for control, from _ the expression of Uph in these cells (n = 2 biological replicates from 1 of 2 
t s.e.m.). d, Uph transcripts were independent experiments; mean +s.e.m.). 

similarly downregulated across each exon-exon junction, measured using 


1 of 2 independent experiments; mean 4 
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Extended Data Figure 7 | ChIP and qPCR analyses of active cardiac 
enhancers regulating Nkx2-5 expression. a, Diagram of the Nkx2-5 locus 
with numbers (1 and 2) indicating the region analysed by qPCR following 
ChIP. Shown in red are the ENCODE/LICR H3K4mel active enhancer 
marks in the heart at E14.5. See Methods for source data. b, ChIP coupled 
with qPCR analysis of H3K4me1 marks, normalized to total histone H3, 
showed that the H3K4me1 marks bound to the Nkx2-5 promoter region 
are unchanged between wild-type, Uph KO and Uph'47°/«T° homozygous 
hearts at E10.0. c, ChIP coupled with qPCR analysis of H3K27ac marks, 
normalized to total histone H3, showed that the H3K27ac marks bound 
to the Nkx2-5 promoter region are unchanged between wild-type, Uph 
KO and Uph'*!9/'eT° homozygous hearts at E10.0. d, GATA4 binding to a 


GATA4 site in the Nkx2-5 promoter is unchanged between wild-type and 
Uph KO hearts at E10.0. e, Diagram of the Uph-Hand2 locus, with black 
bars indicating the regions analysed by qPCR following ChIP. Shown in 
red is the ENCODE/LICR H3K27me3 track for mouse heart. See Methods 
for source data. The branchial arch enhancer (green) and cardiac enhancer 
(yellow) are shown. f, ChIP coupled with qPCR analysis of the polycomb 
repressive marker H3K27me3, normalized to total histone H3, showed no 
differences between genotypes at each locus (1-3) indicated in the diagram 
in e. g, qPCR revealed no difference in the levels of Ser2-phosphorylated 
RNAPII between genotypes at the Nkx2-5 gene body. Each point is one of 
3 technical replicates of 5 pooled hearts for each genotype in each ChIP 
experiment, from 1 of 2 independent experiments; mean +s.e.m. 
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Extended Data Table 1 | Relevant sequences 


SEQUENCE NAME 
Full Length UPH sequence 


UPH qPCR primers 

UPH ISH probe 

Hand2 ISH probe 

UPH targeting 5'arm 
UPH targeting 3'arm 
UPH 5' WT genotyping 
UPH 5' KO genotyping 
UPH 3' WT genotyping 
UPH 3' KO genotyping 
tdTO 5' WT genotyping 
tdTO 5' KO genotyping 
tdTO 3' WT genotyping 
tdTO 3' KO genotyping 
Southern BamHI probe 
Southern NdeI probe 
Handl ISH probe 
Nkx2-5 ISH probe 

ASO negative control 
ASO UPH A 

ASO UPH B 

5GSP1 

3GSP1 

AP1l 

Hand2 - Gata site 
Nkx2-5 - Gata site 
UPH ChIP site 1 

UPH ChIP site 2 

UPH ChIP site 3 
Nkx2-5 ChIP - site 1 


FORWARD 


CACTCATAACCATAAGATAATTAAAACGG 


CATTCTCGAGCAATTCGTCA 


GATGAGACCTTCAGTTTGTGCC 


ATGAGTCTGGTGGGGGGC 


TATCGGAGCTCGCACCTCGGAGCTGGGAA 
TATCGGCTAGCCATATGACCCTAACAGAGATTGCGAAGA 


CTCCTCTCCGGACAAGAATC 
CTCCTCTCCGGACAAGAATC 
AGAGAACGCGGATGAGACCT 
GACCTGCAGCCCAAGCTA 
CTCCTCTCCGGACAAGAATC 
CTCCTCTCCGGACAAGAATC 
AGAGAACGCGGATGAGACCT 
GACCTGCAGCCCAAGCTA 
TGGTTTTCTTGTCGTTGCTG 
TCCTGGGAAGGCACTATGTC 
CCATCATCACCACTCACACC 
TATGGCTACAACGCCTACCC 
AACACGTCTATACGC 
GCTAGTTAAGCAGGA 
ATTCAATTTAGGTCAT 


CCATCCTAATACGACTCACTATAGGGC 


TTTACCCACTGGTCCCCTCT 
CTGCAACTATCACCCGGAAT 
TCACCTCCCCATGTCTTTTC 
ACCTCGGGCTTTCGATCTTA 
GAAACTAGCCTTGCCCCTTC 


ACTGTGAAGCCCAATTCCAG 


REVERSE 


TTTAAAAAATAATTTTTAATATACTATGTGCATGGTTGGATAGGT 


TGGTAGCCCATCTCCAACTC 


ATACTATGTGCATGGTTGGATAGGT 


TCACTGCTTGAGCTCCAGG 


GATACGCGGCCGCGGATCCAGTTGTCATCCTAACTTGGGTCA 
GATACATCGATCAGGGCAGTTAGGTCTCAGC 


TGCTGCAAATGAGTGTGG 
GGTTCCGGATCCACTAGTTCT 
CCCTTGCAAACAGAAGAAAGG 
CCCTTGCAAACAGAAGAAAGG 
TGCTGCAAATGAGTGTGG 
ATGACCTCCTCGCCCTTG 
CCCTTGCAAACAGAAGAAAGG 
CCCTTGCAAACAGAAGAAAGG 
CTGACTGGGTCCTTGAGCAT 
ACCTTCTCCTGCCCTTCATT 
GCGCCCTTTAATCCTCTTCT 
GTGTGGAATCCGTCGAAAGT 


CAGCTGTATGGGCTCAGGTGACTGC 
TGGAGATGGGCTACCATTGGTGTTGA 


TGGACAACATGGGACAGAAA 
AGAAACCCCCATCTGTTTCC 
GAGGAACCTGCATTGCTTTC 
GCTTGGGAAGGTAAGCCTTT 
GGGTGCCTAGGGAGGAATAC 


AACCAGAAATTGTGGCAAGG 
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PI3K~ is a molecular switch that controls immune 


suppression 


Megan M. Kaneda!, Karen S. Messer’, Natacha Ralainirina!, Hongying Li!?, Christopher J. Leem!, Sara Gorjestani!, 
Gyunghwi Wool, Abraham V. Neuyen!, Camila C. Figueiredo’, Philippe Foubert!, Michael C. Schmid!, Melissa Pink‘, 

David G. Winkler*, Matthew Rausch’, Vito J. Palombella‘, Jeffery Kutok*, Karen McGovern’, Kelly A. Frazer*°, Xuefeng Wu’, 
Michael Karin’, Roman Sasik®, Ezra E. W. Cohen? & Judith A. Varner!?»!° 


Macrophages play critical, but opposite, roles in acute and chronic 
inflammation and cancer. In response to pathogens or injury, 
inflammatory macrophages express cytokines that stimulate 
cytotoxic T cells, whereas macrophages in neoplastic and parasitic 
diseases express anti-inflammatory cytokines that induce immune 
suppression and may promote resistance to T cell checkpoint 
inhibitors’. Here we show that macrophage PI 3-kinase + controls a 
critical switch between immune stimulation and suppression during 
inflammation and cancer. PI3K~ signalling through Akt and mTOR 
inhibits NF«B activation while stimulating C/EBP® activation, 
thereby inducing a transcriptional program that promotes immune 
suppression during inflammation and tumour growth. By contrast, 
selective inactivation of macrophage PI3K~ stimulates and prolongs 
NF«B activation and inhibits C/EBP® activation, thus promoting 
an immunostimulatory transcriptional program that restores CD8+ 
T cell activation and cytotoxicity. PIS3K~y synergizes with checkpoint 
inhibitor therapy to promote tumour regression and increased 
survival in mouse models of cancer. In addition, PI3K~-directed, 
anti-inflammatory gene expression can predict survival probability 
in cancer patients. Our work thus demonstrates that therapeutic 
targeting of intracellular signalling pathways that regulate the 
switch between macrophage polarization states can control immune 
suppression in cancer and other disorders. 

We investigated the association between immune responses and 
survival in primary tumours from human papilloma virus (HPV) * 
(n=97) and HPV” (n= 423) head and neck squamous cell carci- 
noma (HNSCC) cohorts from the Cancer Genome Atlas®?. High 
expression levels of pro-inflammatory mRNAs IL12A, IL12B, IFNG 
and CD8A were associated with increased survival in HPV~ but not 
HPV cohorts, whereas high expression of IL6 was negatively asso- 
ciated with survival (Extended Data Fig. la—e). HPV* patients with 
this favourable immune expression profile (n =35) had 97% survival 
at 3 years compared with 57% survival for patients without this profile 
(n= 62) (Fig. 1a). Similar associations were observed in lung adeno- 
carcinoma and gastric carcinoma patients (Extended Data Fig. 1f, g). 
These results suggest that therapeutic approaches that stimulate pro- 
inflammatory gene expression might enhance cancer patient survival. 

We hypothesized that macrophage signalling pathways, such as 
those regulated by the class IB isoform PI3K*, might control the switch 
between immune stimulation and suppression in inflammation and 
cancer. PI3K~y is abundantly expressed in myeloid cells but not in cancer 
cells'-!3 (Extended Data Fig. 1h) and promotes myeloid cell trafficking 
during inflammation and cancer’?"'®, Mice lacking PI3K~ (Pik3cg/~) 
had exaggerated, macrophage-mediated pro-inflammatory responses 


upon exposure to pathogenic stimuli (Fig. 1b and Extended Data 
Fig. 1i-k), suggesting that PI3K~ inhibits macrophage inflammatory 
responses and might also do so in the tumour microenvironment. 
Mice lacking PI3K~ and mice that were treated with PI3K~ antago- 
nists (TG100-115 (ref. 13) or IPI-549 (ref. 17)) exhibited significantly 
(P< 0.05, one-way ANOVA with Tukey’s post-hoc test) suppressed 
growth of implanted HPV*t (MEER, mouse HPV* HNSCC cell line) 
and HPV” (SCCVII, mouse HPV” HNSCC cell line) HNSCC, lung 
carcinoma (LLC), and breast carcinoma (PyMT) tumours (Fig. 1c and 
Extended Data Fig. 11). PI3Ky inhibition did not directly affect the 
growth or survival of tumour cells, which do not express the kinase*-16 
(Extended Data Fig. 1h, m). PI3K~ inhibition suppressed long-term 
growth and metastasis of spontaneous breast tumours, extended 
survival of mice with orthotopic breast tumours and enhanced 
the sensitivity of tumours to the nucleoside analogue gemcitabine 
(Fig. 1d and Extended Data Fig. 1n—q). Although PI3K~4 inhibition 
did not affect the accumulation of CD11b*-Gr1~—F4/80* tumour- 
associated macrophages (TAMs) in tumours (Extended Data Fig. 2a-f), 
it enhanced the expression of major histocompatibility complex class II 
(MHCII) and pro-inflammatory cytokines and inhibited the expression 
of immune-suppressive factors in tumours and TAMs, indicating that 
PI3Ky controls the TAM switch between immune suppression and 
immune stimulation (Fig. le-g and Extended Data Figs 2g-j, 3a-f). 

To determine whether PI3Ky directly regulates macrophage 
polarization, we analysed mRNA and protein expression in primary 
mouse macrophages stimulated in vitro with basal medium con- 
taining colony stimulating factor 1 (CSF-1) or in pro-inflammatory 
(IFN4, lipopolysaccharide (LPS) and CSF-1) or anti-inflammatory 
(interleukin-4 (IL4) and CSF-1) conditions. Pro-inflammatory stimuli 
upregulated macrophage expression of innate immune proteins, 
cytokines and cell-surface receptors, whereas anti-inflammatory 
stimuli induced expression of immunosuppressive factors similar 
to those expressed in TAMs! (Extended Data Fig. 4a—c). Genes and 
proteins associated with immune activation, antigen presentation 
and T cell activation were upregulated in Pik3cg-'~ and PI3K~- 
inhibitor-treated macrophages (Fig. 2a—c and Extended Data Figs 4c-f, 
5a-g). By contrast, genes associated with immune suppression and 
chemoattraction were inhibited (Fig. 2a—c and Extended Data Figs 4c-f, 
5a-g). These results confirm that PI3K~ controls a macrophage switch 
between immune stimulation and suppression. 

To investigate how PI3K~ regulates macrophage immune responses, 
we analysed the DNA-binding activities of NFkB p65-RelA and the 
CCAAT/enhancer binding protein C/EBP®8 in wild-type and Pik3cg- 
null macrophages, as NFkB promotes expression of inflammatory 
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Figure 1 | PI3K~y promotes immune suppression. a, Multivariate 
immune response mRNA signature in HPV* HNSCC patients. n = 97 
biological replicates; P= 0.0001; log-rank test. b, mRNA expression of 
genes involved in the immune response in Pik3cg-/~ or wild-type (WT) 
peritoneal macrophages. n= 3 biological replicates; *P = 0.01; t-test. 

c, Tumour volumes in wild-type, Pik3cg™! ~ and PI3Ky (PI3K~i)-inhibitor- 
treated mice (n= 15 biological replicates) P< 0.05, one-sided ANOVA 
with Tukey’s post-hoc test. Arrow shows time of initiation of daily 
treatment with the PI3Ky inhibitor. d, Tumour weight (P = 0.000001) and 
number of lung metastases (P= 0.007) in mice with PyMT spontaneous 
breast carcinoma (scale bar, 200 1m) in wild-type (n = 21 biological 


cytokines'®, whereas C/EBP promotes expression of the immunosup- 
pressive factor arginase 1 (Arg], refs 19,20). PI3Ky ablation rapidly 
and sustainably stimulated RelA DNA-binding activity in macrophages 
(Fig. 2d and Extended Data Fig. 5h). By contrast, PI3Ky ablation 
suppressed DNA-binding activity of C/EBP® (Fig. 2e). Consistent with 
these findings, PI3Ky inhibition stimulated and sustained p65-RelA 
phosphorylation and simultaneously inhibited C/EBPG and Akt phos- 
phorylation (Fig. 2f, g and Extended Data Fig. 6a, b). 

Subsequently, we examined the effect of PI3Ky inhibition on the 
stability and phosphorylation of proteins that activate NFkB, including 
the TLR4-associated adaptor protein IRAK-1 and inhibitory « B kinase (, 
IKK8, which promotes degradation of IkBa, with subsequent release 
of NF«B from an inhibitory IKB-NFKB complex'®. PI3Ky deletion 
enhanced phosphorylation of IKK and TBK1 and degradation of 
IRAK-1 and IkBa in LPS-stimulated Pik3cg /~ macrophages (Fig. 2h). 
As an IKK® inhibitor suppressed the inflammatory phenotype 
observed in Pik3cg-/~ macrophages (Fig. 2i), these results indicated 
that PI3K~ was both a feedback inhibitor of the TLR4-NFxB activation 
pathway and a promoter of IL4 and C/EBP§ signalling. 
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replicates) and Pik3cg~'~ (n =8 biological replicates) backgrounds (t-test). 


e, MHCII expression in wild-type or Pik3cg-'~ TAMs. n= 3 biological 
replicates; P= 0.01 for both lung and breast tissue, as analysed by t-test. 

f, Fold change of mRNA expression in tumours and tumour-derived 
CD11b* cells from Pik3cg~/~ and PI3K+-inhibitor-treated mice. n=5 
biological replicates; P< 0.01; t-test. g, Heat map of median-centred 
mRNA expression of genes involved in the immune response in tumours 
from wild-type and Pik3cg~/~ mice. n = 3 biological replicates; P= 0.01; 
t-test. All experiments were performed two or more times. b, d, e, Data are 
shown as mean + s.e.m. 


C/EBP® has previously been linked with tumour immune sup- 
pression through its control of Arg! expression'®”°. Expression of 
constitutively activated PI3K~ (Pik3cgCAAX)"” was sufficient to induce 
Arg expression in a manner that was inhibited by Cebpb, Mtor, or 
Rps6kb1 (also known as S6ka) knockdown (Extended Data Fig. 6c, d). 
Both Cebpb knockdown and inhibition of S6Ka or mTOR by inhibitors 
suppressed expression of immune suppressive factors and stimulated 
expression of pro-inflammatory cytokines (Extended Data Fig. 6e-g). 
These results show that PI3Ky promotes immune suppression by 
activating mTOR-S6Ka-C/EBP® and inhibiting NFKB, thereby 
controlling a switch that regulates the balance between immune 
suppression and stimulation. 

Since PI3K+ blockade stimulated pro-inflammatory responses in 
macrophages, we investigated whether PI3K~ blockade in macrophages 
promoted adaptive immunity. TAMs were isolated from wild-type and 
Pik3cg~'~ tumours, mixed with tumour cells and adoptively transferred 
into new wild-type or Pik3cg/~ recipient mice (Fig. 3a). Tumour 
growth was significantly inhibited (P=0.01) in tumours containing 
Pik3cg-'~ TAMs but not wild-type TAMs (Fig. 3b). The number of 
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Figure 2 | PI3K~y regulates NFkB and C/EBP® during macrophage 
polarization. a, Heat map of mean fold-change in gene expression of 
Pik3cg~'~ versus wild-type macrophages stimulated in mCSE, IFN>/LPS 
or IL4 conditions (n = 3 biological replicates) for genes that are 
significantly differently expressed (P= 0.00001) in these two genotypes. 
b, mRNA expression of indicated genes in wild-type or Pik3cg~!~ 

mice. n = 3 biological replicates; *P = 0.01; t-test. c, Expression levels 

of indicated proteins in wild-type and Pik3cg-'~ mice. Data are 

mean +s.e.m.; n = 4 biological replicates; *P = 0.01 or **P=0.001; t-test. 


CD8* T cells was significantly increased (P= 0.0001) in tumours with 
Pik3cg'~ but not wild-type macrophages (Fig. 3c and Extended Data 
Fig. 6h), indicating that PI3Ky signalling in TAMs inhibits CD8* T 
cell recruitment to tumours. To determine whether macrophage-de- 
rived cytokines controlled tumour growth, we implanted tumour cells 
mixed with in vitro cultured macrophages or in conditioned medium 
from cultured macrophages into wild-type mice. Tumour growth was 
enhanced by IL4-stimulated wild-type macrophages and conditioned 
medium from wild-type macrophages, but inhibited by IL4-stimulated 
Pik3cg-‘~ macrophages and conditioned medium from Pik3cg '~ 
or PI3K~-inhibitor-treated macrophages and by all LPS-stimulated 
macrophages and conditioned medium from LPS-stimulated 
macrophages (Fig. 3d, e). To determine which macrophage-derived 
immune factors affect tumour growth in vivo, we treated wild-type and 


d, e, p65 RelA (d) and C/EBP8 (e) DNA-binding activity in wild-type and 
Pik3cg-'~ macrophages. n = 4 biological replicates; *P=0.01 (d) and 
*P=0.04 (e), t-test. f, g, Immunoblotting of pRelA and RelA, pC/EBP8 and 
C/EBP®, and pAkt and Akt in LPS- (f) and IL4-stimulated (g) wild-type and 
Pik3cg! ~ macrophages. h, Immunoblotting of IRAK1, pIKK®6 and IKK8, 
IkBa, pTBK1 and TBKI1, PI3Ky and actin in wild-type and Pik3cg/~ 
macrophages. i, mRNA expression in IKK8-inhibitor-treated macrophages. 
n=3 biological replicates; *P=0.01 or **P=0.001; t-test. All experiments 
were performed two or more times. Data are mean + s.e.m. 


Pik3cg! ~ TAMs ex vivo with inhibitors of mTOR, Arginasel, IKK, 
IL12 or nitric oxide synthase 2 (NOS2) before mixing with tumour cells 
and implanting in mice (Extended Data Fig. 6i). Blockade of mTOR 
or Arginasel in wild-type macrophages suppressed tumour growth, 
while inhibition of NOS2, IL12 or IKK@ in Pik3cg-'~ macrophages 
stimulated tumour growth. These results indicate that PI3Ky¥-mTOR 
mediated immune suppression promotes tumour growth and that 
PI3K~ inhibition reverses these effects by shifting macrophages towards 
NF«B-dependent pro-inflammatory polarization. 

To investigate further whether macrophage PI3Ky controlled 
tumour growth, mice bearing pre-established tumours were treated 
with PI3K7 inhibitors in combination with clodronate liposomes, 
which deplete macrophages from tissues”!. Separate PI3K~+ inhibitor 
and clodronate liposome treatment each partially inhibited tumour 
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Figure 3 | Macrophage PI3K~y suppresses T cell activation. a, Adoptive 
transfer method. b, Weights of tumours resulting from tumour cells 
implanted together with Pik3cg-/~, wild-type or no TAMs. M®, 
macrophages. n = 8 biological replicates; P= 0.01; one-sided ANOVA with 
Tukey’s post-hoc test). c, CD8* T cells per mm? from b. n= 8 biological 
replicates with 2 technical replicates each; P= 0.001; one-sided ANOVA with 
Tukey’s post-hoc test). d, e, Weights of tumours resulting from tumour cells 
implanted together with in vitro cultured macrophages (d, n= 8 biological 
replicates; P=0.01 and P=0.0001, as indicated) or with macrophage- 
conditioned medium (e, n= 8 biological replicates; P=0.01 and P= 0.0001, 
as indicated). f, Percentage of CD4* and CD8* T cells (out of the total 
number of CD3* cells; *P=0.001) and percentage of CD3* T cells 


growth and stimulated T cell recruitment, but the combination had 
no additive effects, confirming that PI3K7 expression in macrophages, 
rather than in other cell types, promotes tumour growth (Extended 
Data Fig. 7a-d). Similar results were observed when CSF1R inhibition” 
and PI3K~ inhibition were combined (Extended Data Fig. 7e, f). 
PI3Ky blockade stimulated T cell recruitment into tumours, as 
total and CD8* T cell content increased in tumours from Pik3cg”/~ 
mice without significantly altering systemic T cell content (Fig. 3fand 
Extended Data Fig. 7g-i). PI3K inhibition did not suppress tumour 
growth in CD8-null or antibody-depleted mice, suggesting that PI3Ky 
inhibition blocked tumour growth by recruiting and/or activating 
CD8* T cells (Fig. 3g and Extended Data Fig. 7j-k). When T cells were 
isolated from tumour-bearing or naive mice, mixed with tumour cells 
and implanted in mice, only T cells from Pik3cg-'~ tumour-bearing 
mice suppressed tumour growth (Fig. 3h). However, PI3Ky 
inhibition did not directly activate T cells, as neither PI3K~ deletion 
nor treatment of T cells with PI3K~ inhibitors ex vivo affected T cell 
proliferation or activation; in contrast, PI3K6 inhibition suppressed 
T cell activation in vitro and promoted tumour growth in vivo 
(Fig. 3i and Extended Data Figs 71, m, 8a, b). PI3Ky inhibition 
promoted T cell-mediated cytotoxicity, as T cells isolated from 
Pik3cg'~ or PI3K+-inhibitor-treated tumours stimulated tumour-cell 
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(out of the total number of live cells; **P=0.01; t-test) in wild-type and 
Pik3cg~'~ tumours (n =3 biological replicates). g, Tumour volumes in 
Pik3cg-'~ and/or Cd8~'~ mice. n= 6 biological replicates; P < 0.03, P< 0.001, 
as indicated; one-sided ANOVA with Tukey’s post-hoc test. h, i, Weights of 
tumours implanted with naive or tumour-derived T cells (h, n= 8 biological 
replicates; P< 0.001; t-test) or inhibitor-treated T cells (i, n = 16 biological 
replicates; *P < 0.01; **P < 0.0001; one-sided ANOVA with Tukey’s post-hoc 
test). j, Protein concentrations of IFNy (*P = 0.001; t-test) and granzyme B 
(P< 0.03, t-test) in tumours and T cells from wild-type and Pik3cg"/~ mice 
(n=3 biological replicates; NS, not significant). All data are shown as 

mean + s.e.m. and all experiments were performed two or more times. 


cytotoxicity (Extended Data Fig. 8c-g). T cells from Pik3cg”'~ or 
PI3K~-inhibitor-treated mice expressed significantly more IFNy and 
granzyme B (P=0.001 and P< 0.03, respectively) and significantly 
less TGFB1 and IL10 protein (P=0.008 and P= 0.03, respectively) 
and mRNA (P< 0.02) than T cells from wild-type mice (Fig. 3j and 
Extended Data Fig. 8h-1). Together, these results indicate that PI3Ky 
inhibition in macrophages indirectly promotes both Th] and cytotoxic 
adaptive immune responses. 

To investigate whether PI3K~ inhibition interacts with other immune 
therapies, we combined PI3K~ and the checkpoint inhibitor anti-PD-1 
in mouse tumour models. PD-L1, but not PD-L2, was expressed in 
macrophages in vitro and in vivo (Extended Data Fig. 9a, b). PI3Ky 
inhibition synergized with anti-PD-1 to suppress the growth of HPV* 
HNSCC tumours in Pik3cg-'~ or inhibitor-treated male or female 
mice, inducing tumour regression in 86% of male and 90-100% of 
female mice, as well as continuous survival to date in 60% of male 
mice and 90-100% of female mice (Fig. 4a—c and Extended Data 
Fig. 9c, d). Notably, PI3K inhibition also synergized with anti-PD-1 
to reduce tumour growth, extending survival and inducing tumour 
regression in 30% of mice bearing HPV HNSCC (SCCVII) tumours 
(Fig. 4d—-f and Extended Data Fig. 9e). The combination of PI3Ky and 
anti-PD-1 inhibitors activated T cell memory, as 100% of mice that 
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Figure 4 | PI3K~ inhibition synergizes with anti-PD-1. a, d, Tumour 
volumes in anti-PD-1 (black arrows) treated wild-type or Pik3cg~/~ mice 
with HNSCC HPV* tumours (a, n = 13 biological replicates) and PI3Ky- 
inhibitor (TG100-115)-treated mice with HPV” HNSCC tumours 

(d, n= 13 biological replicates). ****P = 0.0001; **P= 0.01; one-sided 
ANOVA with Tukey’s post-hoc test. b, e, Per cent survival of mice in a and 
d. c, f, Mean change in tumour volumes from individual mice from a and 
d. g, h, Median-centred heat map of whole tumour mRNA expression (g) 
and mRNA expression of select genes (h) from whole tumours of mice 


had previously cleared HPV* tumours efficiently suppressed tumour 
growth when re-challenged with HPV~* tumour cells and remained 
cancer-free (Extended Data Fig. 9f). PI3Ky and PD-1 inhibitors each 
stimulated immune response gene expression and inhibited immune 
suppressive gene expression, MHCII expression in TAMs and CD8* 
T cell recruitment to tumours; and the combination of therapies 
further elevated these parameters (Fig. 4g-i and Extended Data Fig. 
9g). These studies showed that PI3Ky inhibition can synergize with 
T-cell-targeted therapy to promote anti-tumour immune responses 
that induce sustained tumour regression in mouse models of cancer. 
PI3Ky-regulated immune responses might also affect outcome 
in cancer patients. We identified 43 PI3K~-regulated genes that are 
significantly (P < 0.05) associated with survival in the Cancer Genome 
Atlas HPV* HNSCC patients (Fig. 4j). HPV* HNSCC patients (n = 34) 
with a low PI3K~-activity profile showed 100% survival at 3 years, 
compared to 56% survival for the remaining 63 patients (Fig. 4k). 
In HPV” HNSCC patients, 39 of these genes were significantly 
(P <0.05) shifted in the direction of high PI3K~ activity, consistent 
with a pattern of pervasive immune suppression and reduced survival 


from d. n = 3 biological replicates; *P = 0.01; **P=0.001; ***P= 0.0001; 
one-sided ANOVA with Tukey’s post-hoc test. i, Representative immune 
cell flow-cytometry profiles from mice from d. j, Heat map of PI3Ky- 
regulated mRNA expression in HPV* HNSCC patients (n = 45 patients, 
P<0.05; log-rank test). k, 1, Multivariate PI3K~-regulated immune 
signature in HPV*HNSCC patients (k, n =97; P< 0.001; log-rank test) 
and lung adenocarcinoma patients (1, n = 507; P < 0.001; log-rank test). All 
experiments were performed two or more times. a, d, h, Data are shown as 
mean = s.e.m. 


in HPV~ disease (Extended Data Fig. 10a). In lung adenocarcinoma 
patients, 18 genes predicted survival; 202 patients with a low PI3Ky- 
activity profile had 73% survival at 3 years, compared to 55% survival 
for 305 patients with a high PI3K~-activity profile (Fig. 41). These 
results suggest that a PI3K~-regulated immune suppression signature 
is associated with survival in cancer patients and that PI3K~ inhibitors 
might provide clinical benefits for cancer patients. 

Here we have shown that PI3K7 regulates innate immunity 
during inflammation and cancer (Extended Data Fig. 10b, c). Prior 
studies have implicated PI3Ks in the regulation of pro-inflammatory 
immune responses in macrophages, as pan-PI3K inhibitors and null 
mutations in the PI3K~ effectors PDK1, Aktl and TSC enhanced pro- 
inflammatory NFKB-dependent transcription in macrophages”***, 
while inhibition of PTEN and SHIP, which oppose PI3K function, 
promotes immune suppression”®”’. As macrophage reprogrammin 
can enhance the activity of checkpoint inhibitors in cancer™!422,28) 
our studies indicate that inhibitory targeting of macrophage signalling 
pathways may provide novel approaches to improve the long-term 
survival of cancer patients. 
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METHODS 
Immune-related gene expression signature analysis in the Cancer Genome Atlas 
(TCGA) data. We analysed TCGA data for association between mRNA expression 
level of 16 candidate immune-related genes (ARG1, IL10, FOXP3, CD68, IL12A, 
IL12B, IFNG, CD8A, CD4, ITGAM (also known as CD11B), CD14, TNF, IL1A, 
IL1B, IL6 and CCL5) and 5 year overall survival. lumina HiSeq RNaseqV2 mRNA 
expression and clinical data for 520 head and neck squamous cell carcinoma 
samples were downloaded from the TCGA data portal. Median follow-up from 
diagnosis was 1.8 years with a range of 0.01 years to 17.6 years. Follow-up time was 
truncated at 5-years for analysis and 200 deaths occurred in this period. For each 
of the 16 candidate immune response genes, we scored subjects as above (high) 
or below (low) the median expression and compared survival using a log-rank 
test at 5% significance. HPV* patients were stratified into a favourable immune 
profile if they had expression above the median for the significant genes IL12A, 
IL12B, IFNG, CD8A and below the median for IL6. Kaplan-Meier curves were 
plotted for these two groups. Similar methods were used to examine association of 
these 16 genes with 720 lung adenocarcinoma and 876 gastric carcinoma samples 
using the publically available data from KM Plotter”’. In lung adenocarcinomas, 
12 genes were significantly associated with survival; patients were scored as having 
a favourable immune profile if 7 or more of the 12 significant genes had expression 
in the favourable direction. In 876 gastric cancer samples, 8 genes were significantly 
associated with survival. Patients were scored as having a favourable immune 
profile if 5 out of the 8 genes had expression in the favourable direction. 
PI3K~-regulated gene expression signature analysis in TCGA data. We inves- 
tigated 66 immune-related genes in four functional classes, 17 genes related to 
antigen presentation (HLA class I and II molecules), 24 genes surveying T cell 
activation, 20 innate immune response genes (IL6, CCL7 and others) and 5 genes 
related to cancer cell signalling. These genes changed expression in response to 
PI3K~ inhibition for association with survival in HPV* and HPV- TCGA HNSCC 
and lung adenocarcinoma cohorts. Within each cancer type, we scored subjects as 
above or below the median expression for each gene and compared survival using a 
log-rank test, using 10% false discovery rate (FDR) within each class as the signifi- 
cance threshold. HPV* and HPV~ HNSCC survival were investigated separately, 
as HPV” HNSCC generally has a worse prognosis. Within each cohort, patients 
were classified as having a favourable PI3Ky immune response profile if they had 
expression levels above or below the median in the direction of low PI3K~ activity 
for the genes identified as significant. We compared the survival experience of 
favourable versus less-favourable profiles of patients using Kaplan-Meier curves. 
Out of the 66 experimentally identified PI3K--regulated genes, 43 showed signifi- 
cant association with overall survival in the HPV~ cohort (FDR < 10% within each 
functional class). Comparison of these genes between HPV* and HPV” cohorts 
showed that HPV~ samples generally had significantly (P < 0.05) lower expression 
of 42 genes in the antigen presentation and T cell activation classes, consistent with 
a pattern of adaptive immune suppression, and higher expression of genes in the 
innate immune response and cancer cell signalling classes, which were negatively 
associated with survival. Only MALT] was not differentially expressed between 
the two groups (P=0.7). 
Mice. Pik3cg-'~ and Pik3cg-'~,PyMT mice were generated as previously 
described'3. Cd8~'~ and Cd4~/~ mice with a C57B1/6J background were purchased 
from the Jackson Laboratory and crossed with syngeneic Pik3cg‘~ mice. All 
animal experiments were performed with approval from the Institutional Animal 
Care and Use Committee of the University of California. Animals were euthanized 
before the IACUC maximum allowable tumour burden of 2 cm? per mouse was 
exceeded. 
Tumour studies. Wild-type or Pik3cg~/~ 6-8 week-old female or male syngeneic 
C57BI/6J (LLC lung, PyMT breast and MEER HPV* HNSCC) or C3He/J (SSCVIL 
HPV~ HNSCC) mice were implanted with 10° tumour cells by subcutaneous 
injection (LLC, MEER, SCCVII) or by orthotopic injection (PyMT) (n= 10-15) 
and tumour growth was monitored for up to 30 days. Tumour dimensions were 
measured once when tumours were palpable. Tumour volumes were calculated 
using the equation (? x w)/2. In some studies, wild-type and Pik3cg~/~ mice 
with LLC tumours were treated with gemcitabine (150 mg kg”) or saline by 
intraperitoneal (i.p.) injection on day 7 and day 14 (n= 10). LLC were acquired 
from ATCC, PyMT were from L. Ellies (University of California), HPV MEER 
were from J. Lee (Cancer Biology Research Center, Sanford Research/USD) 
and SCCVII squamous carcinoma cells were from S. Schoenberger (La Jolla 
Institute for Allergy and Immunology). All cell lines were tested for mycoplasma 
and mouse pathogens and checked for authenticity against the International 
Cell Line Authentication Committee (ICLAC; http://iclac.org/databases/cross- 
contaminations/) list. 

In some studies, mice bearing LLC, PyYMT, HPV* MEER or HPV” HNSCC 
tumour cells were treated once daily by oral gavage with vehicle (5% NMP and 
95% PEG 400), 15mg kg"! per day of the PI3Ky inhibitor IPI-549 or by i-p. 
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injection with 2.5mg kg“! twice per day of TG100-115 (ref. 13) beginning on 
day 8 post-tumour injection and continuing daily until euthanasia. IPI-549 is an 
orally bioavailable PI3Ky inhibitor with a long plasma half-life and a Kp value 
of 0.29nM for PI3Ky with >58-fold weaker binding affinity for the other class I 
PI3K isoforms’”. Enzymatic and cellular assays confirmed the selectivity of IPI- 
549 for PI3Ky (>200-fold in enzymatic assays and >140-fold in cellular assays 
over other class I PI3K isoforms!”). To study the effect of IPI-549 on lung tumour 
growth, LLC tumour cells were passaged three times in C57BL/6 albino male mice. 
When tumour volume reached 1,500 mm?, tumours were collected and single-cell 
suspensions were prepared. This tumour cell suspension was implanted subcu- 
taneously in the hind flank of C57BL/6 albino male mice at 10° cells per mouse. 
Prior to initiating treatment with once daily IPI-549 (15mg kg! orally), groups 
were normalized on the basis of tumour volume. In some studies, wild-type- and 
Pik3cg~/~-tumour-bearing mice were treated with 100 1g of anti-CD8 (clone YTS 
169.4) or an isotype-control clone (LTF-2) from Bio X Cell administered by i-p. 
injections on day 7, 10 and 13 of tumour growth. For all tumour experiments, 
tumour volumes and weights were recorded at death. 

Anti-PD-1 tumour studies. C57B1/6J (wild-type) or Pik3cg! ~ 6-8 week-old male 
or female mice (MEER HPV* HNSCC) or C3He/J (SCCVII HPV~ HNSCC) were 
implanted with tumour cells by subcutaneous injection (10° MEER or 10° SCCVII). 
In HPV* MEER studies, wild-type and Pik3cg! ~ mice were treated with four 
doses of 250 1g of anti-PD-1 antibody (clone RMP-14, Bio X Cell) or rat IgG2a 
isotype control (clone 2A3, Bio X Cell) every 3 days, starting when tumours became 
palpable on day 11 (n= 12-14 mice per group). Wild-type mice bearing HPV* 
tumours were also treated with the PI3Ky inhibitor TG100-115 (ref. 13) twice per 
day by i.p. injection, beginning on day 11. Tumour regressions were calculated 
as a percentage of the difference in tumour volume between the date treatment 
was initiated and the first date of death of the control group. For HPV” SCCVII 
studies, C3He/J mice were treated with PI3Ky inhibitor (2.5 mg kg~’ TG100-115 
ip.) beginning on day 6 post-tumour inoculation and with six doses of anti-PD-1 
antibody (250 jg clone RMP- 14, Bio X Cell) or rat IgG2a isotype control (clone 
2A3, Bio X Cell) every 3 days beginning on day 3 (n= 12 mice per group) or with 
a combination of the two. Alternatively, mice were treated with 5mg kg~' TG100- 
115 twice per day + anti-PD-1 (250,1g every 3 days) beginning on day 1 (Fig. 4). 
Mice that completely cleared HPV* MEER tumours were re-injected with HPV* 
tumour cells contralateral to the initial tumour injection and tumour growth was 
monitored. 

PyMT models of mammary carcinoma. The growth and metastasis of spon- 
taneous mammary tumours in female PyMT* (n= 13) and Pik3cg~/",PyMT* 
(n=8) mice was evaluated over the course of 0-15 weeks. Total tumour burden was 
determined by subtracting the total mammary gland mass in PYMT™~ mice from 
the total mammary gland mass in PyYMT* mice. Lung metastases were quantified 
macroscopically and microscopically in H&E tissue sections at week 15. 
LPS-induced septic shock. Septic shock was induced in wild-type and Pik3cg-/~ 
mice via ip. injection of 25 mg kg! LPS (Sigma, B5:005). Survival was monitored 
every 12h and liver, bone marrow and serum were collected 24h after LPS 
injection. 

Macrophage depletion studies. C57B1/6J female mice were implanted with 10° 
LLC tumour cells by subcutaneous injection. When the average tumour size was 
250 mm’, mice were treated by i-p. injection with 1 mg per mouse clodronate or 
control liposomes (www.clodronateliposomes.com) every 4 days for 2 weeks in 
combination with daily administration of vehicle or IPI-549 (15 mg kg”! per day 
orally). In other studies, 6-week-old female BALB/c mice were injected subcuta- 
neously with 2.5 x 10° CT26 mouse colon carcinoma cells in 100,11 phosphate 
buffered saline (PBS) in the right flank. Eight days later, tumour-bearing mice were 
arranged into four groups (n= 15) with an average tumour volume of 70mm”. Oral 
administration of IPI-549 (15mg kg~!) or vehicle (5% NMP and 95% PEG 400) 
and anti-CSF-1R antibody (50mg kg! i.p. 3x per week, clone AFS98, Bio X Cell) 
began on day 8 after tumour injection via oral gavage at a 5 ml kg! dose volume 
and continued daily for a total of 18 doses. 

Tumour-infiltrating myeloid cell analysis. Six-week-old female BALC/c mice 
were injected subcutaneously with 2.5 x 10° CT26 mouse colon carcinoma cells 
in 100,11 PBS in the right flank. On day 8 after tumour injection, tumour-bearing 
mice were grouped and treated daily with IPI-549 (15 mg kg“, orally) or vehicle 
(5%NMP and 95% PEG 400). In addition, mice were injected i.p. with 50 mg kg“! 
anti-CD115 (Bio X Cell clone AFS98) or 50mg kg”! rat IgG2a isotype control 
(Bio X Cell clone 2A3) antibodies as described above for a total of three injections. 
Two days after the final injection mice were euthanized, tumours were digested in 
a mixture of 0.5 mg ml ! collagenase IV and 150 U ml! DNase I in RPMI-1640 
for 30 min at 37°C and tumour-infiltrating myeloid cells were analysed by flow 
cytometry. 

In vivo macrophage adoptive transfer experiments. CD11b*Gr1~ cells were 
isolated from single-cell suspensions of LLC tumours from donor mice by 
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fluorescence-activated cell sorting (FACS) or serial magnetic bead isolation. 
Additionally, for some experiments, primary bone-marrow-derived macrophages 
were polarized and collected into a single-cell suspension. Purified cells were 
admixed 1:1 with LLC tumour cells and 5 x 105 total cells were injected subcuta- 
neously into new host mice. Tumour dimensions were measured three times per 
week beginning on day 7. In antibody blocking studies, CD11b*Gr1~ cells were 
incubated with 51g anti-IL12 (clone RD1-5D9) or isotype (clone LTF-2, Bio X 
Cell) for 30 min before the addition of tumour cells. Mice were additionally treated 
intradermally with 51g of antibody 3 and 6 days after tumour cell inoculation. In 
some studies, CD11b*Gr1~ cells were pre-incubated with inhibitors of arginase 
(nor-NOHA, 501M, Cayman Chemical), iNOS (1400W dihydrocholoride, 100M, 
Tocris), mTOR (rapamycin, 10|1M Calbiochem), or Ik KS (ML120B, 301M, Tocris) 
for 30 min before the addition of tumour cells. Inoculated mice were further treated 
by intradermal injection with inhibitors at 3 and 6 days after inoculation. 

T cell adoptive transfer. Donor C57B1/6J (WT) or Pik3cg! ~ mice were implanted 
with 10° LLC tumour cells by subcutaneous injection. On day 14 after tumour 
implantation, CD90.2*, CD4* or CD8* cells were harvested by magnetic bead 
isolation (Miltenyi Biotec). T cells were mixed 1:1 with viable LLC tumour cells. 
Cell mixtures containing 5 x 10° total cells were injected into the flanks of naive 
wild-type or Pik3cg~/~ mice (n= 8-10 per group). Tumour growth, intratumoral 
apoptosis and necrosis were investigated over 0-16 days. In other studies, wild- 
type T cells were incubated at 37°C and 5% CO) for 6h with 10 or 100nM IPI- 
549 (Infinity Pharmaceuticals) or Cal-101 (Selleck Chem). After 6h, T cells were 
washed, admixed 1:1 with LLC tumour cells, and 10° total cells were injected 
subcutaneously into recipient mice. Tumour growth was monitored for 14 days. 
Isolation of single cells from mouse tumours. Tumours were isolated, minced 
in a Petri dish on ice and then enzymatically dissociated in Hanks balanced salt 
solution containing 0.5mg ml"! collagenase IV (Sigma), 0.1 mg ml“! hyaluroni- 
dase V (Sigma), 0.6 U ml! dispase II (Roche) and 0.005 MU ml! DNase I (Sigma) 
at 37°C for 5-30 min. The duration of enzymatic treatment was optimized for 
greatest yield of live CD11b* cells per tumour type. Cell suspensions were filtered 
through a 70-1m cell strainer. Red blood cells were solubilized with red cell lysis 
buffer (Pharm Lyse, BD Biosciences) and the resulting suspension was filtered 
through a cell strainer to produce a single-cell suspension. Cells were washed once 
with PBS before use in flow cytometry analysis or magnetic bead purification. 
Peritoneal macrophage isolation. Thioglycollate-elicited peritoneal macrophages 
were collected 96h after ip. injection of a 3% thioglycollate solution. Cells were 
collected from the peritoneal cavity in 10 ml of PBS and macrophage enrichment 
was performed by plating cells in RPMI with 10% FBS and 1% penicillin/ 
streptomycin for 2h at 37°C and 5% CO . After 2h, non-adherent cells were 
removed with three PBS washes, and cells were analysed via flow cytometry and 
qPCR analysis. 

Flow cytometry staining and analysis. Single-cell suspensions (10° cells in 100 ji 
total volume) were incubated with aqua live dead fixable stain (Life Technologies), 
FcR-blocking reagent (BD Biosciences) and fluorescently labelled antibodies and 
incubated at 4°C for 1h. Primary antibodies to cell surface markers directed 
against F4/80 (BM8), CD45 (30-F11), CD11b (M1/70), Grl (RB6-8C5), CD3 (145- 
2C11), CD4 (GK1.5), CD8 (53-6.7), CD273 (B7-DC), CD274 (B7-H1) were from 
eBioscience; Ly6C (AL-21), Ly6G (1A8), CD11c (HL3), and MHC-II (AF6-120.1) 
from BD Pharmingen, CCR2 (475301) from R&D Systems and CD206 (MR5D3) 
from AbD Serotech. For intracellular staining, cells were fixed, permeabilized using 
transcription factor staining buffer set (eBioscience) and then incubated with 
fluorescently labelled antibodies to FoxP3 (FJK-16s) from eBioscience. Multicolour 
FACS analysis was performed on a BD Canto RUO 11 colour analyser. All data 
analysis was performed using the flow cytometry analysis program FloJo (Treestar). 
Magnetic bead purification of myeloid cells. Single-cell preparations from bone 
marrow or tumours were incubated with FcR-blocking reagent (BD Biosciences) 
and then with 201] magnetic microbeads conjugated to antibodies against CD11b, 
Gr1, CD90.2, CD4 and CD8 (Miltenyi Biotech MACS Microbeads) per 10’ cells 
for 20 min at 4°C. Cells bound to magnetic beads were then removed from the cell 
suspension according to the manufacturer's instructions. 

Flow cytometric sorting of cells from tumours and bone marrow. For cell 
sorting, single-cell suspensions were stained with aqua live dead fixable stain (Life 
Technologies) to exclude dead cells and anti-CD11b-APC (M1/70, eBioscience) 
and anti-Grl-FITC (RB6-8C5, eBioscience) antibodies. FACS sorting was 
performed on a FACS Aria 11 colour high speed sorter at the Flow Cytometry 
Core at the UC San Diego Center for AIDS Research. Live cells were sorted into 
the following populations: CD11b*Gr1~, CD1 1b+Gr1, CD11b+Gr1" and 
CD11b~Grl~ cells. CD11b-positive cells were defined by increased staining over 
the isotype control, and Grl levels were defined both by comparison to the isotype 
control and relative staining to other populations. 

Mouse macrophage differentiation and culture. Bone-marrow-derived cells 
were aseptically collected from 6-8 week-old female mice by flushing leg bones 


of euthanized mice with PBS, 0.5% BSA, 2mM EDTA, incubating in red cell lysis 
buffer (155mM NH,Cl, 10 mM NaHCO; and 0.1mM EDTA) and centrifuging 
over Histopaque 1083 to purify the mononuclear cells. Approximately 5 x 107 
bone-marrow-derived cells were purified by gradient centrifugation from the 
femurs and tibias of a single mouse. Purified mononuclear cells were cultured in 
RPMI + 20% serum + 50ng ml! mCSF (PeproTech). 

Human macrophage differentiation and culture. Human leukocytes from 
apheresis blood products were obtained from the San Diego Blood Bank. Cells were 
diluted in PBS, 0.5% BSA, 2mM EDTA, incubated in red cell lysis buffer (155 mM 
NH4,Cl, 10 mM NaHCO; and 0.1mM EDTA) and centrifuged over Histopaque 
1077 to purify mononuclear cells. Approximately 10° bone-marrow-derived cells 
were purified by gradient centrifugation from one apheresis sample. Purified 
mononuclear cells were cultured in RPMI + 20% serum + 50ng ml~! Human 
mCSF (PeproTech). Non-adherent cells were removed after 2h by washing and 
adherent cells were cultured for 6 days to differentiate macrophages fully. 
Macrophage polarization. Bone-marrow-derived macrophages were polarized 
with IFN+ (20ng ml~!, Peprotech) + LPS (100 ng ml !, Sigma) or LPS alone 
for 24h, or IL4 (20ng ml~1, Peprotech) for 24—-48h. For inhibitor studies, PI3K-y 
inhibitors (11M) (IPI-549, Infinity Pharmaceuticals and TG100-115, Targegen/ 
Sanofi-Aventis), rapamycin (10 1M, Selleck), or ML120B (301M) were incubated 
with macrophages 1h before the addition of polarizing stimuli. Total RNA was 
harvested from macrophages using the RNeasy Mini Kit (Qiagen) according to 
the manufacturer's instructions. 

RNA sequencing. Freshly isolated mouse bone marrow cells from nine wild- 
type and nine Pik3cg /~ mice were pooled into three replicates sets of wild-type 
or Pik3cg ~/~ cells and differentiated into macrophages for 6 days in RPMI + 
20% FBS+ 1% penicillin/streptomycin + 50ng ml! mCSF. Each replicate set 
of macrophages was then treated with mCSF, IL4 or IFNy/LPS. Macrophages 
were removed from dishes, and RNA was collected using Qiagen Allprep kit. In 
addition, RNA was harvested from day 14 (500mm*) LLC tumours or purified 
CD11b*Gr1-F480* TAMs from wild-type (C57BL/6) and Pik3cg~/~ mice. RNA 
was collected using the Qiagen Allprep kit. RNA libraries were prepared from 
1jug RNA per sample for sequencing using standard Illumina protocols. RNA 
sequencing was performed by the University of California, San Diego Institute for 
Genomic Medicine. mRNA profiles were generated by single read deep sequencing, 
in triplicate, using Illumina HiSeq2000. 

Sequence analysis. Sequence analysis was performed as previously described'®. 
Sequence files from Illumina HiSeq that passed quality filters were aligned to the 
mouse transcriptome (mm9 genome build) using the Bowtie2 aligner4. Gene- 
level count summaries were analysed for statistically significant changes using 
DESeq. Individual P values were adjusted for multiple testing by calculating 
Storey’s q values using fdrtooltrimmer. For each gene, the q value is the smallest 
false discovery rate at which the gene is found significant. We analysed biological 
processes as defined by the Gene Ontology Consortium. Each gene ontology term 
defines a set of genes. The entire list of genes, sorted by the q value in ascending 
order, was subjected to a non-parametric variant of the gene set enrichment 
analysis (GSEA), in which the parametric Kolmogorov—Smirnov P value was 
replaced with the exact rank-order P value. We perform a Bonferroni adjustment 
of gene set P values for the number of gene sets tested. Heat maps of expression 
levels were created using in-house hierarchical clustering software that implements 
Ward clustering. The colours qualitatively correspond to fold changes. 
Individual quantitative RT-PCR. cDNA was prepared using 1 jig RNA with the 
qScript cDNA Synthesis Kit (Quanta Biosciences). Sybr green-based qPCR was 
performed using human and mouse primers to Arg, Ifng, 1/10, I112p40, II1b, II6, 
Ccl2, Vegfa, Gapdh, Nos2, Tgfb1, Tnfa and mouse H2-Aa, H2-Ab1, H2-Eb1, and 
H60a (Qiagen QuantiTect Primer Assay). mRNA levels were normalized to Gapdh 


(AC, = Ctgeneofinterest ~ Ctcapan) 2nd reported as relative mRNA expression 


AAC, = y-(ac tsample AC icons) or fold change. 


siRNA-mediated knockdown and gene transfection. Freshly isolated bone- 
marrow-derived CD11b* myeloid cells or differentiated macrophages were trans- 
fected by electroporation using an AMAXA mouse macrophage nucleofection 
kit with 100 nM of siRNA or 21g Pik3cgCAAX or pcDNA control plasmid. Non- 
silencing (Ctrl_AllStars_1) siRNA and Cebpb (MmCebpb_4 and MmCebpb_6), 
and Mtor (Mm_Frap1_1 and Mm_Frap1_2) siRNAs were purchased from Qiagen. 
After transfection, cells were cultured for 36-48 h in RPMI containing 10% serum 
and 10 ng ml“! mCSF (PeproTech) or polarized as described above. 

ELISA assays. Whole tumours, CD11b*Gr1~ cells, CD90.2* cells, CD4* cells and 
CD8* cells isolated from LLC tumours were lysed in RIPA buffer and total protein 
concentrations were determined using a BCA protein assay (Pierce). Macrophage 
supernatants (10011) or 500,1g of total protein lysate from tumours were used in 
ELISAs to detect CCL2, TGF@, IL18, TNFa, IL6, IFNy, IL10, IL12 and granzyme B 
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(ready set go ELISA, eBioscience). Protein expression was normalized to total 
volume (supernatants) or mg total protein (tumour lysates). 

Quantitative colourimetric arginase determination. The QuantiChrom arginase 
assay kit (DARG-200, BioAssay Systems) was used to measure arginase activity in 
primary mouse bone-marrow-derived macrophages from wild-type and Pik3cg /~ 
mice according to the manufacturer’s instructions. For all conditions, cells were 
harvested and lysed in 10 mM Tris (pH 7.4) containing 11M pepstatin A, 11M 
leupeptin, and 0.4% (w/v) Triton X-100. Samples were centrifuged at 20,000g at 
4°C for 10 min. 

Transcription factor assays. To measure NF«KB and C/EBP activation, TransAM 
NF«B family and C/EBP transcription factor assay kits (43296 and 44196, Active 
Motif) were used according to the manufacturer’s protocol. Briefly, wild-type 
and Pik3cg~/~ bone-marrow-derived macrophages were stimulated with LPS 
(100ng ml!) or IL4 (20 ng ml!) and nuclear extracts were prepared in lysis 
buffer AM2 (Active Motif). Nuclear extracts were incubated with the immobilized 
consensus sequences and RelA, cRel or C/EBP$ were detected using specific 
primary antibodies. Quantification was performed via colourimetric readout of 
absorbance at 450 nm. 

Immunoblotting. IL4 and LPS macrophage cultures were solubilized in RIPA 
buffer containing protease and phosphatase inhibitors. Thirty micrograms of 
protein was electrophorezed on Biorad precast gradient gels and electroblotted 
onto PVDF membranes. Proteins were detected by incubation with 1:1,000 
dilutions of primary antibodies, washed and incubated with goat anti-rabbit- 
HRP antibodies and detected after incubation with a chemiluminescent substrate. 
Primary antibodies directed against Akt (11E7), p-Akt (244F9), IkBa (L35A5), 
IkK® (D30C6), p-IkKa/B (16A6), RelA (D14E12), pRelA (93H1), C/EBP3 (#3087), 
p-CEBP8 (#3082), IRAK1 (D51G7), TBK1 (D1B4) and PI3K~y (#4252) were from 
Cell Signaling Technology and pTBK1 (EPR2867(2)) was from Abcam. 

In vitro cytotoxicity assay. CD90.2* tumour-derived T cells were purified from 
LLC tumour-bearing wild-type and Pik3cg~/~ or TG100-115 and control treated 
mice and then co-incubated with LLC tumour cells (target cells) at 2.5:1, 5:1 and 
10:1 ratios of T cells to tumour cells (2 x 10° LLC tumour cells per well) for 6h. 
Target cell killing was assayed by collecting the supernatants from each well for 
measurement of the lactate dehydrogenase release (Cytotox96 non-radioactive 
cytotoxicity assay kit, Promega). 

Immunohistochemistry. Tumour samples were collected and cryopreserved in 
OCT, sections (5 um) were fixed in 100% cold acetone, blocked with 8% normal 
goat serum for 2h, and incubated anti-CD8 (53-6.7, 1:50 BD Biosciences) for 2h 
at room temperature. Sections were washed three times with PBS and incubated 
with Alexa594-conjugated secondary antibodies. Slides were counterstained with 
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4',6-diamidino-2-phenylindole (DAPI) to identify nuclei. Immunofluorescence 
images were collected on a Nikon microscope (Eclipse TE2000-U) and analysed 
using Metamorph image capture and analysis software (Version 6.3r5, Molecular 
Devices). The detection of apoptotic cells was performed using a TUNEL-assay 
(ApopTag fluorescein in situ apoptosis detection kit, Promega) according to 
the manufacturer’s instructions. Slides were washed and mounted in DAKO 
fluorescent mounting medium. Immunofluorescence images were collected on a 
Nikon microscope (Eclipse TE2000-U) and analysed with MetaMorph software 
(version 6.3r5) or SPOT software (version 4.6). Pixels per field or cell number per 
field were quantified in five 100 x fields from ten biological replicates. 

Statistics. Primary tumour samples with mRNA expression data were scored as 
above or below the median expression level, and tested for association with patient 
survival using a log-rank test at 5% significance. For studies evaluating the effect of 
drugs on tumour size, tumour dimensions were measured directly before the start 
of treatment, tumour volumes were computed and mice were randomly assigned 
to groups so that the mean volume + s.e.m. of each group was identical. A sample 
size of ten mice per group provided 80% power to detect a mean difference of 2.25 
standard deviation (s.d.) between two groups (based on a two-sample f-test with 
two-sided 5% significance level). Sample sizes of 15 mice per group provided 80% 
power to detect one s.d. difference between two groups. Data were normalized 
to the standard (control). Analysis for significance was performed by one-way 
ANOVA with a Tukey’s post-hoc test for multiple pairwise testing with more than 
two groups and by parametric or nonparametric Student's t-test when only two 
groups were compared. We used a two-sample t-test (two groups) and ANOVA 
(multiple groups) when data were normally distributed and a Wilcoxon rank sum 
test (two groups) when data were not normally distributed. All mouse studies 
were randomized and blinded; assignment of mice to treatment groups, tumour 
measurement and tumour analysis was performed by coding mice with randomly 
assigned mouse number, with the key unknown to operators until experiments 
were completed. In tumour studies for which tumour size was the outcome, mice 
removed from the study owing to health concerns were not included in endpoint 
analyses. All experiments were performed at least twice; n refers to biological 
replicates. 

Data availability. RNA sequencing data can be accessed using numbers GSE58318 
(in vitro macrophage samples) and GSE84318 (in vivo tumour and tumour- 
associated macrophages samples) at www.ncbi.nlm.nih.gov/geo. 


29. Gyérffy, B., Surowiak, P., Budczies, J. & Lanczky, A. Online survival analysis 
software to assess the prognostic value of biomarkers using transcriptomic 
data in non-small-cell lung cancer. PLoS One 8, e82241 (2013). 
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Extended Data Figure 1 | Pro-inflammatory gene expression signatures 
predict survival in cancer patients. a—e, Association of IL12A (P=0.026), 
IL12B (P=0.039), IENG (P = 0.002), CD8A (P= 0.001) and IL6 
(P=0.001) with survival in 97 HPV* HNSCC patients (log-rank test). 

f, Multivariate immune signature for 720 lung adenocarcinoma samples 
from patients from KM plotter cohorts (P= 0.001; log-rank test). 

g, Multivariate immune signature in 876 gastric cancer samples from 
patients from KM plotter cohorts (P= 0.001; log-rank test). h, Western 
blotting of PI3Ky in B cells, T cells, macrophages (M®) and LLC, PyMT 
and MEER tumour cells. i, Kaplan-Meier survival plot of wild-type 

(WT) and Pik3cg"'~ mice inoculated with LPS (endotoxin). P=0.05, 
log-rank test. j, Pro-inflammatory cytokine mRNA expression in bone 
marrow from wild-type and Pik3cg'~ LPS-injected mice. n = 4 biological 
replicates; **P < 0.001; ***P < 0.0001; one-sided ANOVA with Tukey’s 
post-hoc test. k, Circulating inflammatory cytokine levels in Pik3cg-/~ 


and wild-type mice 24 h after endotoxin administration. n = 4 biological 
replicates; *P < 0.01; **P < 0.001; one-sided ANOVA with Tukey’s post- 
hoc test. 1, Tumour volume of implanted HPV~ (SCCVII) carcinomas 
(n= 15 biological replicates) from vehicle or PI3K4-inhibitor-treated 
mice. Arrow, start of drug treatment; P= 0.001; t-test. m, Dose-response 
of the effect of PI3Ky inhibitor IPI549 on in vitro MEER cell viability. 

n, Spontaneous PyMT lung metastases per high-power field (200 x) in 
wild-type and Pik3cg'~ mice. n=8 biological replicates; P= 0.007; t-test. 
o, Kaplan-Meier survival plot of mice bearing orthotopic PyYMT tumours 
treated with vehicle or the PI3K7 inhibitor IPI549 initiated as indicated 
by the arrow (n = 10). p, In vitro LLC tumour cell survival in the presence 
of gemcitabine. q, Volume of LLC tumours implanted in wild-type and 
Pik3cg-'~ mice treated with saline or gemcitabine. n = 10 biological 
replicates; **P < 0.001; ***P < 0.0001. All experiments were performed 
two or more times. j-l, m, q, Data are shown as mean + s.e.m. 
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Extended Data Figure 2 | Effect of PI3K inhibition on tumour 
inflammation. a, Gating strategy for flow cytometry analysis of myeloid 
cell populations in peripheral blood leukocytes. b, Representative flow 
cytometry analysis and quantification of myeloid cell populations in 
peripheral blood (PB) of naive and LLC tumour-bearing mice. n =3 
biological replicates; P < 0.008; t-test. c, Flow cytometry analysis of 
myeloid cell populations on days 0, 7, 14 and 21 after subcutaneous 
inoculation with Lewis lung carcinoma cells (n = 3 biological replicates). 
d, Quantification of populations from c. e, Flow cytometry analysis of 
Ly6G, CCR2, CX3CR1, CD206, CD11c, F4/80 and CD45 expression in 
myeloid cell populations from ¢ (n= 3 biological replicates). f, Relative 
immune response transcript levels + 


PyMT 


BS 
a 


MEER 
N 
3 


% live cells 


° 


li Grt . 
+Gri- ® Nai 
Len Ge 
i NaivePB Tumor PB 4 
R1R2R3 o 
: 8 
g 
= 9 
R1 R2~ R3 
Gri- = Grtlo Grthi 
21 d 
25 
o 
3 re -&-CD11bGr1- (R1) 
is —#-CD11bGrtlo (R2) 
2 10 “@-CD11bGrthi (R3) 
x5 
— 0 
0 7 14 21 28 
Days 
D206 CD11¢ F4/80 CD45 
é 3 1 
1 - * ' -&-CD11bGr1- (R1) 
a : -#-CD11bGr1lo (R2) 
7 t ~® CD11bGrthi (R3) 
i 


Tgfb1 


14 21 28 1421 0 7 14 21 28 0 7 14 
. J 
PH0.008 prtoy=/- Control PISKyi 
e E- 
= 
= a 
=. 
a 
i 
a Ww 
wW = 
= 
| | 5 5 
Oo (o} 
oO oO 
a n n 


Grilo — Grthi 


Gr1 


+ s.e.m. in tumour-derived myeloid 


% live cells 
N 
oOo 


“& CD11b+Gr1- 


4 “® CD11b+Gr1lo 
] o& CD11b+Grthi 
%© CD11b- Gr1- 


0 7 14 21 28 


21 28 
60 = vehicle 
" PI3Kyi 
40 
20 


Gri- = Grilo— Grthi 


cells and tumour cells (CD11b Gr1- cells) isolated at day (d)0 (n = 3), 

d7 (n=5), d14 (n= 3) or d21 (n=4) after LLC cell inoculation (P < 0.002, 
d21 versus d0). n, biological replicates. g Flow cytometry analysis of 
CD11b* myeloid cell populations in wild-type and Pik3cg-/~ LLC, PyMT 
and MEER tumours (n = 3 biological replicates). h, Quantification of 
CD11b* myeloid cell populations (P= 0.001; t-test) from g. i, Flow 
cytometry analysis of CD11b* myeloid cell populations in vehicle 

and PI3K~-inhibitor-treated PyMT, MEER and SCCVII tumours 

(n= 3 biological replicates). j, Quantification of CD11b* myeloid cell 
populations from i. All experiments were performed two or more times. 
b, d, h, j, Data are shown as mean + s.e.m. 
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Extended Data Figure 3 | Effect of PI3K¥ inhibition on TAM expression — and CD11b*Gr1"™ (granulocyte) myeloid cells isolated from LLC tumours 


profile. a, Heat map of differentially expressed immune response genes grown in Pik3cg~'~ mice (n=5 biological replicates) and normalized to 

in TAMs isolated from LLC tumours from wild-type and Pik3cg~'~ mice wild-type control. n =5 biological replicates; P= 0.001; one-sided ANOVA 
(n=3 biological replicates; local false discovery rate < 0.1) obtained by with Tukey’s post-hoc test. e, Arginase activity in tumours and TAMs 

RNA sequencing. b, Relative mRNA expression of immune response isolated from LLC tumours grown in wild-type and Pik3cg-/~ mice. n=4 
factors in HPV* HNSCC MEER tumours from Pik3cg-/~ and wild-type biological replicates; ***P < 0.0003; t-test. f, Protein concentration of 
mice (n= 4 biological replicates), *P=0.01; t-test. c, Relative mRNA cytokines in LLC tumours and TAMs from wild-type and Pik3cg~/~ mice. 
expression of immune response factors in CD11b* myeloid cells isolated n= 4 biological replicates; *P < 0.01; **P < 0.001; ***P < 0.0001; t-test. 
from PyMT tumours grown in vehicle or PI3Ky-inhibitor-treated mice All experiments were performed two or more times. b-f, Data are shown 
(n=4 biological replicates), *P=0.01; t-test. d, Fold change in mRNA as mean +s.e.m. 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | Effect of PI3K~ deletion on in vitro 
macrophage mRNA expression. a, Relative immune response mRNA 
expression in Pik3cg-'~ and wild-type (WT) mouse macrophages 
stimulated by IL4- or LLC-tumour-cell-conditioned medium as 
determined by RT-PCR. Data are shown as mean +s.e.m.; n= 3 biological 
replicates; *P = 0.01; t-test. b, Heat map of differentially expressed 
immune response transcripts in IL4- and IFNy/LPS-polarized mouse 
macrophages obtained by RNA sequencing. n = 3 biological replicates; 
P=0.00001. c, Heat map of select differentially expressed immune 
response transcripts in in vitro polarized mouse macrophages. n = 3 
biological replicates; P=0.00001. d, Heat map of immune response 
transcripts in mCSF-, IL4- and IFNy/LPS-stimulated Pik3cg™! ~ mouse 


macrophages obtained by RNA sequencing and normalized to 
wild-type macrophages. n = 3 biological replicates; P= 0.00001. e, Heat 
map of select differentially expressed immune response transcripts in 
polarized Pik3cg-'~ mouse macrophages normalized to wild-type. n =3 
biological replicates; P= 0.00001. f, Heat map of differentially expressed 
antigen presentation and processing mRNAs in mCSF, IL4 and 
IFN+/LPS-polarized Pik3cg-'~ mouse macrophages. n = 3 biological 
replicates; P=0.00001. g, Heat map of differentially expressed 
chemokine and chemokine receptor mRNAs in polarized Pik3cg/~ 
mouse macrophages. n = 3 biological replicates; P= 0.00001. All 
experiments were performed two or more times. 
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Extended Data Figure 5 | Effect of PI3K~ inhibition on mouse and 
human macrophage polarization. a, b, Relative mRNA expression of 
immune response transcripts in IL4 and IFNy/LPS-stimulated vehicle and 
PI3K~ inhibitor (IPI-549)-treated mouse (a) and human (b) macrophages. 
n= 3 biological replicates; *P = 0.01; **P=0.001; t-test. c, Relative mRNA 
expression of M2 macrophage markers (Arg1, Retnla (also known as Fizz1) 
and Chil3 (also known as Ym1) in wild-type and Pik3cg~'~ IL4-stimulated 
macrophages (n = 3 biological replicates; P = 0.05; t-test). d, Relative RNA 
expression of MHC family members in wild-type and Pik3cg~'~ IL4- 
stimulated macrophages. n = 3 biological replicates; P= 0.01; t-test. 
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e, f, mRNA expression of cytokines over time in IFN/LPS, LPS and IL4 
stimulated wild-type or Pik3cg~/~ (e) and vehicle- or PI3K-inhibitor- 
(IPI-549)-treated (f) macrophages (n= 3 biological replicates). g, Relative 
mRNA expression in mCSF-stimulated wild-type or Pik3cg~/~ and 
IPI-549- or vehicle-treated macrophages. n = 3 biological replicates; 
P=0.01; t-test. h, Relative nuclear RelA DNA binding activity in 
IFN1/LPS stimulated wild-type and Pik3cg~'~ macrophages. n = 3 
biological replicates; P= 0.01; t-test. All experiments were performed 


two or more times. 
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Extended Data Figure 6 | Mechanism of PI3K~\-mediated gene 
expression regulation. a, Relative levels of phospho/total p65 and 
phospho/total C/EBP@ in LPS- and IL4-stimulated wild-type and 
Pik3cg”! ~ macrophages. b, Immunoblotting to detect pThr308Akt, total 
Akt, phospho-p65 and total p65 in LPS- and IL4- stimulated macrophages 
that were treated with vehicle or the PI3K4 inhibitor IPI-549. c, Relative 
Argl mRNA expression in myeloid cells transfected with constitutively 
active, membrane-targeted PI3Ky (Pik3cgCAAX) and Mtor, S6ka, 

Cebpb or control siRNA. n=3 biological replicates; P= 0.001, t-test. 

d, Validation of mRNA expression in macrophages expressing siRNAs 
from c (n = 3 biological replicates). e, Effect on cytokine mRNA expression 
in wild-type macrophages transfected with Cebpb, Mtor or S6ka siRNA. 


Days 


Days 


n= 3 biological replicates; *P = 0.01; **P=0.001; t-test. f, g, Cytokine 
mRNA expression in macrophages treated with rapamycin (f) or the S6K 
inhibitor PF4708671 (g) (n=3 biological replicates, P= 0.001, t-test). 

h, Immunofluorescence images of CD8* T cells in 101m tumour sections 
from animals in Fig. 3b, c. i, Tumour volumes from tumour cells mixed 
with wild-type TAMs pretreated with the mTOR inhibitor rapamycin or 
the arginase inhibitor nor-NOHA and Pik3cg-/~ TAMs pretreated with 
anti-IL12 or isotype-matched control antibody (cIgG), the IkK8 inhibitor 
MLB120 or the NOS2 inhibitor 1400W dihydrochloride (n = 10 biological 
replicates; P <0.04, one-sided ANOVA with Tukey’s post-hoc test). All 
experiments were performed two or more times. c-g, i, Data are shown as 
mean =r s.e.m. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


m Vehicle Control 
a ; b CD11b+Gr1-F4/80+ = C cptibcrt Faso m Vehicle Clodronate 
= Vane Sale i) Tumor Liver m PI3Kyi + Control 
S 4200.| = PI3Kyi + Control S ol ‘p< 0.02 or “p < 0.005 ® 39 ry ey + Clodronate 
= = PI3Kyi + Clodronate 3S 8 ica} g 
& Vv o1 2 3 + 
2 800 Treatment Q 3 > 2 a 2 
5 Start Es = ° 
3 400 : Bi 5 
£ 0 - & 0 & 
= & PRR FU PI3Kyi:- + = + 
ie) 5 10 15 20 25  Clodronate: - : = Clodronate: - = + 
e s *p<0.05 f *p<0.001 g 
E 2 *p<0.001 
& 8 eae Wt p110y-/- 
o 2a 
5 5S 
$ oa 
5 BO 
E 3s 
2 & 
PIBKyi: - + - 
i : WT p110y-/- 
PISKyi: - + : 7 Anti-CD115: - - + 
Anti-CD115: - - + + 5 Spleen, Normal mice 
h PYMT over *p=0.001  *p=0.04 * p=0.001 ! *p=0.001 wt 
WT PHOysE 104") p110y-/- 
108 2 8 Gwrt 2 
sf us 3 40 2, Bpttoy-/- 3 
| + ro) + 
Qo | 2 04 2 
ae im, fom | 8 20 25 8 
10” a — | = ss x 
Pill a a aT CT 0 0 
cD4..: ——— > CD3+ CD4+ CD8+ 
HPV+ MEER 16 Spleen, Tumor bearing mice 
Control PI3Kyi 
1 2 12 Ocontrol “ 
toy“ HH 8 BIPi-549 3 
3 ¥ « 38 8 
‘om 2 oo + 
A 7 Qa © 2 
aE oe ea S 24 8 
> x RS 
0 1 10 1 01 to 2 0 - 
CD4+ CD8+ 
nA Cbae CD4+ ——sCD8+ 
HPV- (SCCVII) ao 
Control PI3Kyi 25 *p = 0.03 Spleen, Tumor bearing mice 
aa Control 
10° 6 « 60 
40% ee ia BPI3Kyi | 
8 o 4 ° 40 
om -- . $ 
0 4 wa ae ae 22 8 20 
x ° 
TR TO OF oie Io “io 0 - 
CD4+ CD8+ 0 
cp4 Css FoxP3-  FoxP3+ 
J k : 
— WT + saline = 
~ gq 00 )— wr + anti-cos 20 rPsO Ot Bwt 
bY < —p110y-/- + Saline ; P=0.01 mitagy./- 
E E —p110;-/- + anti-CD +46 — 
— = 400 ae 
o o 0512 
5 5 ee 
§ 8 200 89 08 
5 5 2 804 
2 e 6 0 - 
0 5 1 1% 20 25 Fr 0 10 15 20 saline a-CD8 saline o-CD8 
Days Days WT p110 y-/- 
Naive . m ow 
E 0.16 ‘Tumor Bearing @ p110y-/- IFN Granzyme B 
YY 
c 0.25 oWT 
3 @p110y-/- E10 pts a 
B 0.12 02 7 = —S. 1000 
o a 
ae 0.15 28 800 
s 64 3g 6 600 
8 0.04 . = 4 400 
ae 0.05 | 82 200 
0 0 80 0 
Ant-CD3  - + + - + + Anti-CD3 = * e F Basal anti-CD3 + Basal anti-CD3 + 
Anti-CD28 - - + - - + Anti-CD28 - = & eS = anti-CD28 anti-CD28 
IL-2 . fos 4 ae IL GS .o« a ar 


Extended Data Figure 7 | No direct effect of PI3K+ inhibition on T cells. 
a, Volumes of LLC tumours treated with vehicle + control liposomes, 
PI3Ky inhibitor (IPI-549) + control liposomes, clodronate liposomes + 
vehicle and PI3Ky inhibitor + clodronate liposomes. n = 10 biological 
replicates; P= 0.005; one-sided ANOVA with Tukey’s post-hoc test. 

b, Quantification of F4/80* macrophages in tumours from a. n= 3 
biological replicates; P= 0.02; one-sided ANOVA with Tukey’s post- 

hoc test. c, Quantification of F4/80* macrophages in livers from a. 

n=3 biological replicates; P< 0.005; one-sided ANOVA with Tukey’s 
post-hoc test. d, Quantification of T cells in tumours from a. n = 3 
biological replicates; *P < 0.05, one-sided ANOVA with Tukey’s post-hoc 
test. e, Volumes of CT26 tumours treated with vehicle + clgG, PI3Ky 
inhibitor (IPI-549) + clgG, anti-CD115 + vehicle and PI3Ky inhibitor + 
anti-CD115. n= 15 biological replicates; P= 0.05; one-sided ANOVA 
with Tukey’s post-hoc test. f, Quantification of CD11b* myeloid cells in 
tumours from e. n= 5 biological replicates; P< 0.001; one-sided ANOVA 
with Tukey’s post-hoc test. g, Images and quantification of CD8* T cells 
in wild-type and Pik3cg-'~ LLC tumours by immunohistochemistry. n =5 


biological replicates; P= 0.001; one-sided ANOVA with Tukey’s post-hoc 
test. h, Flow cytometry analysis and quantification of T cell populations 
in tumours from wild-type and Pik3cg~/~ or IPI-549-treated mice. (n=3 
biological replicates; P < 0.05; t-test. i, Quantification of T cells in spleens 
of naive and LLC tumour-bearing wild-type and Pik3cg-/~ mice. n=3 
biological replicates; P= 0.001; t-test. j, Volumes of LLC lung tumours 
from wild-type, Pik3cg-'~, CD8~/~ and CD8~'~, Pik3cg~'~ mice. n= 12 
biological replicates; P< 0.001; one-sided ANOVA with Tukey’s post- 
hoc test. k, LLC tumour volume from wild-type and Pik3cg-/~ mice 
treated with anti-CD8 antibodies or control (n= 10 biological replicates; 
P=0.004; one-sided ANOVA with Tukey’s post-hoc test) and per cent 
CD8* T cells out of CD3* T cells in these tumours (n = 3 biological 
replicates; P= 0.01; t-test). 1, In vitro proliferation of T cells (mean +s.e.m. 
absorbance at 560 nm) isolated from naive and LLC tumour-bearing 
wild-type and Pik3cg~'~ mice (n=3 biological replicates). m, IFN-y 

and granzyme B protein expression in T cells from | (n = 3 biological 
replicates). All data are shown as mean + s.e.m. and all experiments were 
performed two or more times. 
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Extended Data Figure 8 | PI3K~ inhibition relieves T cell exhaustion. 
a, Expression of IFNy in activated human T cells treated with PI3Ky 
and PI3K6 inhibitors. Data are shown as mean + s.d.; n =2 biological 
replicates. b, Tumour weights derived from a mixture of LLC cells and 
wild-type or Pik3cg~/~ tumour-derived T cells or wild-type T cells 
pre-incubated with 10 or 100 nM PI3Ky (IPI-549) and PI3K6 (Cal101) 
inhibitors before implantation. n= 16 biological replicates; P= 0.005 
(Pik3cg-'~); P=0.05 (PI3K~ji); one-sided ANOVA with Tukey’s post-hoc 
test. c, d, LLC tumour cell cytotoxicity induced by T cells isolated from 
LLC tumours from wild-type and Pik3cg~/~ (c) or control- and PI3Ky- 
inhibitor-treated (d) mice. n = 3 biological replicates; *P < 0.001; 

t-test. e, Images of TUNEL and haematoxylin and eosin stained tumours 
implanted with WT, Pik3cg™! ~ or no T cells from tumours shown in 

Fig. 3h. f, Quantification of TUNEL” cells in tumour sections from e. 
n= 10 biological replicates; P= 0.01; t-test. g, Tumour volumes in 
wild-type mice of tumours derived from LLC tumour cells mixed 1:1 
with CD90.2*, CD4t and CD8° T cells or no T cells from wild-type or 


Cas Grzb 


Pik3cg-/~ tumour-bearing mice. n= 8 biological replicates; P= 0.001; 

one way ANOVA with Tukey’s post-hoc test. h, mRNA expression of 

IL10 (P= 0.008; t-test) and TGFS (P= 0.03, t-test) protein expression in 
lysates from tumour and CD90.2*, CD8* and CD4* T cells isolated from 
LLC tumours grown in wild-type and Pik3cg”'~ mice (n =3 biological 
replicates). i, IFNy (P=0.13, t-test) and granzyme B (P= 0.004, t-test) 
protein expression in PI3Ky-inhibitor- or control-treated LLC tumours 
(n=3 biological replicates). j, fng and Tgfb1 mRNA expression in 

T cells isolated from LLC tumours grown in wild-type and Pik3cg-/~ or 
control- and PI3K~-inhibitor-treated mice. n = 3 biological replicates; 
P=0.05, t-test). k, Relative mRNA expression of Cd4, Cd8, Gzmb and Ifng 
in control- and PI3K~+-inhibitor-treated PyMT tumours. n = 3 biological 
replicates; P= 0.05, t-test. 1, Relative mRNA expression of Cd4, Cd8, 
Gzmb and Ifng in wild-type and Pik3cg~'~ and PI3Ky-inhibitor-treated 
HPV* MEER tumours (n = 3 biological replicates, t-test). All experiments 
were performed two or more times. b-d, f, g-j, 1, Data are shown as 

mean + s.e.m. 
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Extended Data Figure 9 | PI3K~y role in the macrophage-mediated HNSCC tumour volume in mice that were treated with PI3Ky inhibitor 


tumour immune response. a, b, Flow cytometry analysis of PD-L1 and 
PD-L2 expression in tumour cells and TAMs from wild-type and Pik3cg”/~ 
LLC tumours (a) and wild-type and Pik3cg~'~ in vitro cultured IFN+/LPS~ 
and IL4-stimulated macrophages (b) (= 3 biological replicates). c, HPV~ 
HNSCC tumour volume in female wild-type or Pik3cg~'~ mice that were 
treated with anti-PD-1 or isotype-matched antibody (cIgG), as indicated 
by arrows and per cent change in tumour volumes between days 11 and 23. 
n= 10 biological replicates; *P = 0.01; **P = 0.001; ***P=0.0001; 

**P = (1.00001; one-sided ANOVA with Tukey’s post-hoc test). d, HPV* 
HNSCC tumour volume in female wild-type mice that were treated with 
PI3K~ inhibitor (2.5 mg kg~' TG100-115 twice per day) in combination 
with anti-PD-1 or isotype-matched antibody (clgG), as indicated by 
arrows, and per cent change in tumour volumes between days 11 and 29. 
n= 10 biological replicates; *P=0.01; **P=0.001; ***P=0.0001; 

**** P — (00001, one-sided ANOVA with Tukey’s post-hoc test). e, HPV~ 


(2.5 mg kg~! TG100-115 twice per day) in combination with anti-PD-1 or 
clgG, as indicated by arrows, per cent change in tumour volumes between 
days 19 and 26 and survival of treated mice. n= 10 biological replicates; 
*P=0.01; **P=0.001; ***P = 0.0001; ****P= 0.00001; one-sided 
ANOVA with Tukey’s post-hoc test). f, Tumour volume in HPV* mice 
that had previously cleared HPV* tumours and that were re-challenged 
with new HPV* tumours (n = 7-12 biological replicates) compared to 
wild-type mice newly implanted with HPV* tumours (n= 5 biological 
replicates). ***P = 0.0001; ****P = 0.00001; one-sided ANOVA with 
Tukey’s post-hoc test). g, Per cent CD3*, CD4* and CD8* T cells and 
MHCII* macrophages from Fig. 4i. n = 3 biological replicates; *P = 0.05; 
** P= 0.005; ***P = 0.0005; ****P = 0.00005; one-sided ANOVA with 
Tukey’s post-hoc test. All experiments were performed two or more times. 
c-g, Data are shown as mean + s.e.m. 
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Extended Data Figure 10 | PI3K~ promotes immune suppression. 
a, Comparison of median gene expression between HPV* (n=97) and 
HPV~ (n= 423) cohorts indicating HPV” samples had significantly 
(P <0.05, log-rank test) lower expression of adaptive immune genes and 
higher expression of immune suppressive and/or pro-metastasis genes. 
Blue, HPV~ samples; red, HPV* samples. b, Model depicting the effect 
of PI3K~y inhibition on tumour immune suppression. PI3K7 inhibition 
converts tumour-associated macrophages into pro-inflammatory 
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macrophages that promote a CD8* T cell response that suppresses 
tumour growth. c, Model depicting the PI3Ky signalling pathway in 
macrophages. PI3K7 activation attenuates NF«B activation and promotes 
mTOR-dependent C/EBP8 activation, leading to expression of immune 
suppressive factors and tumour growth. By contrast, PI3Ky inhibition 
inhibits C/EBP® and stimulates NFKB, leading to altered expression of 
pro-inflammatory immune response cytokines. 
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Overcoming resistance to checkpoint blockade 
therapy by targeting PI3K~ in myeloid cells 
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Recent clinical trials using immunotherapy have demonstrated 
its potential to control cancer by disinhibiting the immune 
system. Immune checkpoint blocking (ICB) antibodies against 
cytotoxic-T-lymphocyte-associated protein 4 or programmed 
cell death protein 1/programmed death-ligand 1 have displayed 
durable clinical responses in various cancers!. Although these new 
immunotherapies have had a notable effect on cancer treatment, 
multiple mechanisms of immune resistance exist in tumours. 
Among the key mechanisms, myeloid cells have a major role in 
limiting effective tumour immunity”*. Growing evidence suggests 
that high infiltration of immune-suppressive myeloid cells correlates 
with poor prognosis and ICB resistance*®. These observations 
suggest a need for a precision medicine approach in which the 
design of the immunotherapeutic combination is modified on 
the basis of the tumour immune landscape to overcome such 
resistance mechanisms. Here we employ a pre-clinical mouse model 
system and show that resistance to ICB is directly mediated by the 
suppressive activity of infiltrating myeloid cells in various tumours. 
Furthermore, selective pharmacologic targeting of the gamma 
isoform of phosphoinositide 3-kinase (PI3K~y), highly expressed 
in myeloid cells, restores sensitivity to ICB. We demonstrate that 
targeting PI3K~ with a selective inhibitor, currently being evaluated 
in a phase 1 clinical trial (NCT02637531), can reshape the tumour 
immune microenvironment and promote cytotoxic-T-cell-mediated 
tumour regression without targeting cancer cells directly. Our 
results introduce opportunities for new combination strategies 
using a selective small molecule PI3K~ inhibitor, such as IPI-549, 
to overcome resistance to ICB in patients with high levels of 
suppressive myeloid cell infiltration in tumours. 
Tumoutr-associated myeloid cells (TAMCs) constitute a major com- 
ponent of the tumour microenvironment. Although some controversies 
persist in the precise description of the distinct subsets of this heter- 
ogeneous population, it is accepted that these cells promote tumour 
immunosuppression’. Recent studies support their contribution to 
the suppression of T cell function, which is not abolished by the use 
of ICB®"'. To understand the association between resistance to ICB 
and myeloid cell infiltration, we compared multiple mouse tumour 
models treated with ICB. We show that mice bearing 4T1 breast car- 
cinoma are resistant to anti-PD-1 (programmed cell death protein 1) 
or anti-CTLA4 (cytotoxic-T-lymphocyte-associated protein 4) ther- 
apy (Fig. la and Extended Data Fig. 1). We observe that myeloid cells 
(CD11b*) constitute the majority of CD45* tumour-infiltrating leu- 
kocytes (TILs) in this model (Fig. 1b). This correlates with reduced 
CD8* T cell infiltration and cytolytic function (Fig. 1b, c). By con- 
trast, B16-F10 melanoma tumours, which are more responsive to ICB 
(Fig. la and Extended Data Fig. 1), exhibit less myeloid cell infiltration 


but contain more activated CD8* T cells (Fig. 1b, c). Additionally, 
CD8* T cells express more granzyme B in the B16-F10 model. They 
also express higher levels of PD-1 and CTLA4 (Fig. 1c, data not shown), 
which might explain their sensitivity to ICB. Furthermore, myeloid 
cells from 4T1 tumours or spleens suppress proliferation of T cells 
to a greater extent compared to myeloid cells from B16-F10 models 
(Fig. 1d and Extended Data Fig. 1b). These data suggest that TAMCs 
have varying phenotypes and are more suppressive in ICB-resistant 
tumours. Tumour-derived soluble factors such as granulocyte- 
macrophage colony-stimulating factor (GM-CSF) help shape the 
tumour microenvironment by promoting myelopoiesis and recruit- 
ment of suppressive myeloid cells'*'’. To directly assess the ability 
of suppressive myeloid cells to induce resistance to ICB in the B16 
melanoma, we used B16-F10 cells transduced with a GM-CSF expres- 
sion construct (B16-GM-CSE)"“. Tumours in this B16 model (B16-GM- 
CSF) become infiltrated by suppressive TAMCs and lose sensitivity 
to ICB compared to B16-F10 controls (Fig. la~d and Extended Data 
Fig. 1), indicating the critical role of suppressive myeloid cells in ICB 
resistance. 

Kaneda et al.'° and other previous studies’®'” have shown that 
PI3K-7 is highly expressed in myeloid cells and promotes migration 
and production of inflammatory mediators. Mice lacking the p1107 
subunit of PI3Ky show reduced tumour growth and immunosuppres- 
sive TAMCs’®. We reasoned that pharmacological PI3Ky inhibition 
might lead to an effective anti-tumour immune response and sub- 
vert ICB resistance caused by suppressive myeloid cells. To test this 
hypothesis, we used IPI-549, a selective PI3K4 inhibitor in multiple 
tumour models!’ (Extended Data Fig. 2a). IPI-549 treatment alone 
led to tumour growth inhibition in 4T1, B16-GMCSF, MC38, CT26, 
and LLC tumour models (Fig. 2a and Extended Data Fig. 2b). Lack of 
activity in tumours with few suppressive TAMCs (B16-F 10, Fig. 2a) 
suggests PI3K~ inhibition affects myeloid cells and has no direct effect 
on tumour cells or other TILs. We also observe a significant reduction 
in lung metastasis after IPI-549-treatment (Fig. 2b). TIL quantifica- 
tion shows no consistent differences in the total myeloid (CD11b*) or 
macrophage (CD11b*F4/80*) cell populations in either IPI-549- or 
vehicle-treated 4T1 or B16-GMCSF tumours (Fig. 2c). The effects of 
IPI-549 on TAMC subsets were comparable to those seen in Pik3cg '~ 
mice’*"'®. PI3K+ inhibition switches the activation of macrophages from 
an immunosuppressive M2-like (CD11btF4/80'CD206*) phenotype 
to a more inflammatory M1-like (CD11b*F4/80*MHCII*) state 
(Fig. 2c and Extended Data Fig. 2c). We further tested RNA expression 
of M1 and M2 markers in 4T1 and B16-GMCSF tumours after IPI- 
549. The expression of prototypic M2 markers (TGF, Arg-1, IDO) 
are reduced, while M1 markers (IL-12, INOS) are higher in IPI-549- 
treated tumours (Fig. 2d). Given that CD11b*F4/80* macrophages 
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constitute only a portion of suppressive TAMCs, we further subdivided 
myeloid cells into granulocytes (CD11b*Ly6G*), monocytes (Ly6Chi8"/ 
MHCIL, Mono-Lo), immature macrophages (LyCo"8/MHCIIP8h, 
Mono-Hi), M1 macrophages (Ly6C!’’/MHCII"8*, TAM-M1) or M2 
macrophages (Ly6C!°’/MHCII'’, TAM-M2)?>!. When we analysed 
the myeloid cells with the above gating strategy in 4T1 TILs, we also 
observed that IPI-549 shifts myeloid cells towards the TAM-M1 popu- 
lation (Extended Data Fig. 3a). Moreover, relative mRNA expression of 
M1 and M2 markers correlate with a less suppressive function of these 
cells (Extended Data Fig. 3b). We subsequently tested the suppressive 
function of myeloid cells derived from IPI-549-treated B16-GMCSF 
tumour-bearing mice on naive CD8* T cell proliferation (Fig. 2e). We 
show that suppression of CD8* T cells is abolished in IPI-549-treated 
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Figure 1 | Resistance to checkpoint blockade is associated with 
suppressive myeloid cells infiltration in tumour microenvironment. 

a, Top, mean tumour volume of subcutaneous (4T1) or intradermal 

(B16, B16-GMCSF) implants in anti-PD-1-, anti-CTLA4- or control- 
treated mice (n= 10). Bottom, survival of 4T1, B16 or B16-GMCSF 
tumour bearing mice treated with anti-PD1 or anti-CTLA4 compared to 
control (vehicle treated only) (n= 10). b, Representative flow cytometric 
analysis and quantification of CD11b*, CD8*, CD4* and regulatory 

T cells (Treg cells; CD4*Foxp3*) cell populations in 4T1, B16, B16-GMCSF 
tumours at 14 days post implantation (n = 5). c, Representative flow 
cytometric analysis and quantification of granzyme B, PD-1 expression on 
CD8* T cell populations in 4T1, B16, B16-GMCSF tumours at 7 days post 
implantation (n =5). d, In vitro suppressive activity of tumour-infiltrating 
CD11b* cells purified from 4T1, B16 and B16-GMCSF tumour- 

bearing mice. Representative histograms of CD8* T cell proliferation at 
corresponding CD8* to CD11b* T cell ratio (left panel) and quantification 
of CD8* T cell proliferation using carboxyfluorescein succinimidy] ester 
(CFSE) dilution (right panel) (n = 3). Data represent analysis of n (shown 
above for each experiment) mice per group, mean + s.e.m. Experiment 
were repeated at least twice to observe concordant statistical significance 
defined by P value: *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 
(non-parametric Mann-Whitney U-test, log-rank (Mantel—-Cox) test 
survival comparison). 


mice or when IPI-549 is added to the media. A similar functional 
observation was made in human myeloid suppressor cells in PBMCs 
(Extended Data Fig. 4b) confirming its potential clinical use. In addi- 
tion, pharmacodynamic evaluation of whole blood confirms the inhi- 
bition of PI3Ky in monocytes after activation with a PI3Ky stimulus 
in human volunteers (Extended Data Fig. 4a.). To confirm that inhibi- 
tion of PI3Ky in myeloid cells is required to delay tumour progression, 
we depleted myeloid cells (anti-CD11b) before implanting LLC-Brei 
tumours in mice. Treatment with IPI-549 did not delay tumour growth 
in the absence of TAMCs (Extended Data Fig. 2e). Taken together these 
findings suggest that PI3K inhibition using IPI-549 is mainly effective 
in a tumour landscape rich in suppressive myeloid cells and allows for 
more precise delineation of patients in which it will potentially yield 
the greatest activity. 

The IPI-549-driven polarization of myeloid cells to a less immuno- 
suppressive phenotype correlates with increased CD8* T cell infiltrates 
and a higher CD8*/regulatory T cell ratio at day 14 in both 4T1 and 
B16-GMCSF tumours (Fig. 3a). CD8* T cells expressed more granzyme 
B and proliferated more (Ki67*) in IPI-549 treated groups (Fig. 3a and 
Extended Data Fig. 5a). Along with these effector mechanisms, the 
expression of PD-1 and CTLA4 were enhanced in T cells (Fig. 3a and 
Extended Data Fig. 5a). Furthermore, in mice lacking T cells (RAG 
KO, Nu/Nu or after anti-CD8* T cell depletion) the impact of IPI-549 
was abrogated (Fig. 3b and Extended Data Fig. 5b, c). These findings 
confirm that targeting PI3Ky enhances the T-cell-mediated anti- 
tumour activity by modifying the suppressive function of myeloid cells. 
We were concerned that targeting a key pathway in myeloid cells may 
affect their antigen presentation. Reassuringly, we found that IPI-549 
did not reduce, but rather enhanced, the activation of tumour-antigen- 
specific T cells. This was demonstrated by the transfer of tumour- 
antigen-specific cells (Pmel-1) in B16-GMCSF tumour-bearing mice 
(Fig. 3c) and by ELISpot using cells isolated from CT26 tumour-bearing 
mice (Extended Data Fig. 5d). Additionally, the proportion of effec- 
tor memory cells (CD44+CD62L_) in CD8* and CD4* populations 
was not affected (Extended Data Fig. 6), suggesting that IPI-549 does 
not affect the ability of antigen-presenting cells to activate antigen- 
specific T cells. Taken together, these results demonstrate that selective 
PI3K7 inhibition reduces the immune-suppressive function of myeloid 
cells, ultimately promoting T effector activation and T-cell-mediated 
cytotoxicity. 

We found that treatment of myeloid-enriched tumours with IPI-549 
leads to upregulation of PD-1 and CTLA4 expression on CD8* T cells 
(Fig. 3a and Extended Data Figs 2d and 5a). This strongly suggests that 
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Figure 2 | Selective targeting of PI3K~ reduces 
tumour growth and metastasis in various 
checkpoint blockade-resistant tumour models 
associated with high levels of myeloid cell 
infiltrates. a, Therapy regimen (top panel, PO, 

per os, QD, once a day) and mean tumour volume 

of subcutaneous (4T1) or intradermal (B16- 

GMCSF) implants in mice treated with vehicle or 
IPI-549 (15 mg kg“! administered orally, daily) 

(n= 10; bottom panel). b, Representative pictures 
and haematoxylin and eosin-stained sections of 

lung from B16-GMCSF tumour-bearing mice at 

day 7 on treatment with IPI-549 or vehicle (left 
panels). Quantification of lung metastasis at day 

14 and 21 from tumour challenge (right panel). 

c, Flow cytometric analysis and quantification of 
CD11b*F4/80* (TAM) cell populations in 4T1 
tumours at day 7 post implantation (n = 5), expression 
of CD206 (M2) and MHCII (M1) in CD11b*F4/80+ 
cell populations. The histogram bars show the 
percentage change in each cell population (CD11b*, 
TAM, M2 and M1) in IPI-549-treated group in 
comparison to the vehicle-treated controls. d, mRNA 
expression of selected M1 and M2 markers in 4T1 
and B16-GMCSF tumours as determined by RT-PCR 
(n= 3), data were relative to GAPDH expression 

and normalized versus the mean of vehicle-treated 
tumours. e, In vitro suppressive activity of tumour- 
infiltrating CD11b* cells purified from 4T1 
tumour-bearing mice at day 7 post treatment with 
IPI-549 or vehicle. Representative histograms of 
CD8* T cell proliferation at a ratio of 1:1 CD8* to 
CD11b* T cells (left panel) and per cent CD8* T cell 
proliferation (right panel) (n= 3 biological replicates). 
Data represent analyses of n (shown above for each 
experiment) mice per group, mean +s.e.m. *P < 0.05, 
**P < 0.01, ***P < 0.001 (non-parametric Mann- 
Whitney U-test). 


Figure 3 | Reduction of myeloid suppressive 
phenotype correlates with higher anti-tumour 

T cell activity. a, Left, quantification by flow 
cytometry of ratio of M1/M2 in CD11b*F4/80* cell 
population, CD8* T cells in CD45* TILs and ratio of 
CD8*/Tyeg (CD4*FoxP3*) in CD45* TILs in 4T1 and 
B16-GMCSF tumours at day 7 and 14 on treatment 
(n=5). Right, quantification of granzyme B and 

PD1 expression in CD8* TILs in 4T1 and B16-GMCSF 
tumours at day 7 and 14 on treatment. b, Mean 
tumour volume of intradermal B16-GMCSF tumours 
in IPI-549- versus vehicle-treated wild-type C57Bl6 
(left) or Rag~/~ mice (right) (n = 10). c, Flow 
cytometric analysis and quantification of gp-100 
antigen-specific T cells transferred into IPI-549- or 
vehicle-treated B16-GMCSF tumour-bearing mice 
(n=5). Data represent analysis of n (shown above 

for each experiment) mice per group, mean + s.e.m. 
*P< 0.05, **P< 0.01 (non-parametric Mann- 
Whitney U-test). 
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Figure 4 | Resistance to checkpoint blockade therapy is overcome 
when combined with selective PI3K~ inhibition. a, Therapy regimen. 
b, Mean tumour volume of subcutaneous 4T1 tumour in control, anti- 
CTLA4- or anti-PD-1-treated mice in combination with IPI-549 or vehicle 
(n= 10). ¢, Individual tumour volumes of subcutaneous 4T1 implants in 
mice treated with anti-CTLA4 and anti-PD1 in combination with IPI- 
549 or vehicle (n = 10), dotted red line corresponds to end of treatment 
post tumour implantation (day 21). CR, complete response defined as 
no visible or palpable tumour on consecutive measurements. d, Survival 
at 150 days of 4T1 tumour-bearing mice treated with anti-CTLA4 and 
anti-PD1 in combination with IPI-549 or vehicle compared to control 
(vehicle-treated only) (n= 10). e, Mean tumour volume of intradermal 
B16-GMCSF implants in anti-PD-1-, anti-CTLA4- or control-treated 
mice in combination with IPI-549 or vehicle (n= 10). f, Individual 
tumour volumes of intradermal B16-GMCSF implants in mice treated 
with anti-CTLA4 and anti-PD1 in combination with IPI-549 or vehicle 
(n= 10). g, Survival at 150 days of B16-GMSCF tumour-bearing mice 
treated with anti-CTLA4 and anti-PD1 in combination with IPI-549 

or vehicle compared to control (vehicle-treated only) (n = 10). Data 
represent analyses of n (shown above for each experiment) mice per 
group, mean + s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 
(non-parametric Mann-Whitney U-test or log-rank test). 


combining PI3K~ inhibition with ICB therapy could yield additional 
anti-tumour activity. In fact, in the ICB-resistant 4T1 or B16-GMCSF 
models, the combination of either anti-CTLA4 or anti-PD-1 with IPI- 
549 significantly delayed tumour growth when compared to ICB alone 
(Fig. 4a-g). We further confirmed this observation in the ICB-resistant 
LLC tumour model (Extended Data Fig. 7). Analysis of TILs in the 
combination regimen of IPI-549 with PD-1 blockade shows that the 
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addition of IPI-549 reduces immune suppression by increasing the M1/ 
M2 ratio, resulting in improved T cell effector function in 4T1 and B16- 
GMCSF tumours (Extended Data Fig. 8). These results are similar to 
those reported by Kaneda et al. in which PD-1 blockade was combined 
with a dual PI3K inhibitor (+/6)!°; however, given the important effect 
of PI3K6 in T cell function and more specifically regulatory T cells, it is 
unclear whether the observed anti-tumour drug effect in that study 
was myeloid- and/or T-cell-dependent. As CTLA4 and PD-1 have 
distinct immunologic mechanisms of action, greater therapeutic 
efficacy is achieved when both therapies are combined”*. We tested 
the anti-PD-1 and anti-CTLA4 combination with IPI-549 in the 4T1 
and B16-GMCSF models (Fig. 4a). Double checkpoint blockade ther- 
apy alone did not result in any complete tumour regressions in 4T1 
tumour-bearing mice and only 20% of the mice bearing B16-GMCSF 
tumours benefited from the therapy, further confirming multiple 
checkpoint resistance in these models. Notably, the addition of IPI- 
549 to the combination of anti-CTLA4 and anti-PD-1 was associated 
with complete remissions in 30% of 4T1- and 80% of B16-GMCSF 
tumour-bearing mice. Importantly, tumour-free survivors were resist- 
ant to tumour re-implantation (Extended Data Fig. 9), indicating 
long-lasting adaptive immunity. 

Taken together, our data indicate that high infiltration by sup- 
pressive TAMCs promotes resistance to ICB, further confirming 
the complex, multi-factorial mechanisms of immune suppression in 
the tumour microenvironment. We show that modulating the suppressive 
phenotype of these cells towards a more inflammatory one can be 
achieved by selective pharmacologic targeting of the myeloid PI3Ky 
isoform. By doing so, sensitivity to ICB can be effectively restored 
in TAMC-rich tumours. These findings provide a strong rationale to 
consider exploring PI3K~ inhibition to overcome resistance to ICB 
in clinical trials based on a precision-medicine-type assessment of 
the tumour immune landscape. This hypothesis is currently being 
tested in the IPI-549 Phase I/Ib trial, as both a monotherapy and in 
combination with PD-1 blockade in solid tumours (clinicaltrials.gov 
identifier NCT02637531). 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Cell lines. The murine cancer cell lines for melanoma (B16F10, referred to as B16), 
breast cancer (4T1) and colon cancer (CT26) were obtained from ATCC. The colon 
cancer cell line MC38 was obtained from the NCI and the lung cancer (LLC-Brei) 
from Caliper Life Sciences. Cells were maintained in RPMI medium supplemented 
with 10% fetal calf serum (FCS) and penicillin with streptomycin (complete RPMI 
media). The GMCSF-secreting B16 cell line (referred as to B16-GMCSF) has been 
reported and was used to increase the number of myeloid cells recruited to the 
tumour", The cell lines have been mycoplasma tested. 

Quantitative RT-PCR. cDNA was prepared using 11g RNA with the qScript 
cDNA Synthesis Kit (Quanta Biosciences). Sybr green-based qPCR was performed 
using murine primers to Arg1, Ifng, 1110, I112p40, Il1b, 116, Ccl2, Gapdh, 
Nos2, Tgfb1, Tnfa, IL4ra, Indo, Ctla4, Pd-l1, CD86, CxCr2, Fizz1, Ymd1 (Qiagen 
QuantiTect Primer Assay). mRNA levels were normalized to Gapdh (AC, = C; 
gene of interest - C, Gapdh) and reported as relative mRNA expression 
(AAC, = 27 (ACrsample— ACicontrol)) or fold change. 

Cytokine analysis. Tumours were excised, snap-frozen in liquid nitrogen, and 
pulverized using a tissue grinder. Tumour protein lysates were prepared in MSD 
Tris Lysis Buffer (Meso Scale Discovery, Rockville, Maryland) containing 2 x Halt 
Protease and Phosphatase Inhibitor Cocktail (Fisher Scientific). Total protein con- 
centration was normalized to 4mg ml! and cytokines were quantified using the 
MILLIPLEX map Mouse Cytokine/Chemokine Magnetic Bead 32 Plex Panel and 
MILLIPLEX map Mouse CD8* T Cell Magnetic Bead Panel kits according to the 
manufacturer's instructions (Millipore, Billerica, Massachusetts). 

RNA-seq. Mice bearing CT26 tumours were treated with vehicle or IPI-549 
(15mg kg? day~!, PO) for 6 or 9 days. Tumours were isolated, and frozen until 
needed; tumours then thawed and RNA was extracted from all cells. RNA-seq was 
done at Expression Analysis (Q2 Solutions). Sequence reads were aligned to the 
mouse B38 reference genome using OmicSoft ArrayStudio and the UCSC gene 
model. log3[FPKM] was calculated for each gene, and data were mean centred for 
display in heat maps. The analysis focused on a compilation of about 4,200 mouse 
genes related to cancer immunology and PI3K pathway signalling compiled from 
numerous sources including BioCarta pathways, GO gene ontologies, KEGG path- 
ways, WikiPathways, and literature?>. 

Mice. Female C57BL/6J and Balb/c mice (6-8 weeks old) were purchased from 
Jackson Laboratory. Pmel-1 TCR transgenic mice have been previously reported” 
and were provided by N. Restifo (National Cancer Institute, Bethesda, Maryland). 
All mice were maintained in micro-isolator cages and treated in accordance with 
the NIH and American Association of Laboratory Animal Care regulations. All 
mouse procedures and experiments for this study were approved by the Memorial 
Sloan Kettering Cancer Center Institutional Animal Care and Use Committee. No 
statistical methods were used to predetermine sample size. Ten to fifteen mice per 
treatment strategy were used to allow 90% power, and a 5% significance level, and 
detect differences in tumour-free survival from 10% to 80%. Typically, tumours 
grew in 100% of control animals. An additional 5 mice per group were used for 
tissue harvest at day 7 and day 14 on treatment. Mice cage and treatment allowance 
were randomized at day 7 after tumour implants. 

Tumour challenge and treatment experiments. On day 0 of the experiments, 
tumour cells were injected intradermally (i.d.) in the right flank. For the B16 model, 
2.5 x 10° B16-WT or B16-GMCSEF cells were injected into C57BL/6J mice. For 4T1 
model and for the CT26 model, 5 x 10° cells were used subcutaneously in Balb/c 
mice. For studies in immune compromised mice, the CT26 study was done in the 
Balb/c NU/NU strain and the B16-GMCSF in C57B1.6 Rag1~/~ mice. Mice were 
obtained from Jackson Labs and Charles River Labs. Treatments were given as 
single agents or in combinations with the following regimen for each drug. The 
PI3K~y inhibitor drug IPI-549 was dissolved at 5% 1-methyl-2-pyrrolidinone in 
polyethylene glycol 400 and administered by oral gavage once a day at 15mg kg"!. 
Treatment was initiated on day 7 ending on day 21 post tumour implant. Control 
groups received vehicle (5% NMP, 95% PEG) without the active product. Anti- 
CTLA4 antibody (100 1g per mouse, clone 9H10, Bio X cell) and anti-PD-1 anti- 
body (250,1g per mouse, clone RPM1-14, Bio X cell) were injected intraperitoneally 
(i.p.) on days 7, 10, 13 and 16 for the B16, B16-GMCSF and 4T1 models and on 
days 10, 13 and 16 for the CT26 model. Tumours were measured every second or 
third day with a calliper, and the volume (length x width x height) was calculated. 
Mice that had no visible and palpable tumours that could be measured on consecu- 
tive measurement days were considered complete regressions. Animals were eutha- 
nized for signs of distress or when the total tumour volume reached 2,500 mm*. For 
the re-challenge study: mice with complete responses in the anti-PD-1 treatment 
group and the anti-PD-1 and IPI-549 combination group were re-challenged with 
2.5 x 10° CT26 WT tumour cells (on day 106 of the study since original tumour 
implant). Additional mice with complete responses from an additional IPI-549 and 
anti-PD-1 group were implanted with 1 x 10° 4T1 tumour cells. 


Cell depletion studies. Female BALB/c mice were depleted of CD8* T cells by 
intraperitoneal (IP) injection of a CD8-depleting antibody beginning 3 days 
before tumour implantation and continuing every 3 days (Q3D) for the duration 
of the study. Control animals were injected IP with an isotype control antibody 
according to the same dosing schedule. CD8* T cell depletion was verified by flow 
cytometry of splenic cells from a separate cohort of mice at the time of tumour 
implant. Anti-CD8 antibody (BioXCell, in vivo MAb Rat IgG2b anti-mouse CD8, 
clone 169.4) and isotype control antibody (BioXCell, in vivo MAb Rat IgG2b, 
k isotype control, clone LTF-2) were used. CD8* T-cell-depleted and isotype 
contro-treated mice were implanted with 2.5 x 10° CT26 WT cells subcutaneously 
to the dorsal right flank. Eight days after implant, tumour-bearing animals with 
average tumour volumes of 50 to 60 mm? began treatment. Animals were dosed 
daily with vehicle or IPI-549 (15mg kg~!, PO) and dosed every three days with 
100 1g antibody, either isotope control or anti-CD8. Tumour measurements were 
taken every second or third day during the 13 days of dose administration. For 
CD11b* cell depletion, murine Lewis Lung Carcinoma tumour brei (here referred 
to as LLC) was propagated into ten C57BL/6 Albino male mice. When tumours 
reached an average of 1,500 mm’, tissue was collected and made into a single-cell 
suspension. Tissue was dissociated in a glass dunce, filtered through a 100m filter 
and washed with cold buffer of PBS pH 7.2, 0.5% BSA and 2mM EDTA. An aliquot 
of cells was separated, and placed on ice until the time of re-transplant. Cells were 
counted and adjusted to 2 x 10’ total cells. To deplete the CD11b cells from the 
tumour homogenate, CD11b micro beads (MACS #130-049-601) were added to 
the sample and incubated on ice for 15 min. Cells were washed and re-suspended 
with buffer. A negative depletion was preformed following the Miltenyi autoMACS 
protocol. The positive selected cells and starting tumour inoculum were counted 
and cell numbers adjusted to the tumour inoculum used for historical studies. For 
each cell condition 30 C57BL/Albino male mice were inoculated. Treatment of 
IPI-549 or vehicle started when tumours for each cell condition reached an average 
of 200 mm’ regardless of what day post implant this occurred. 

Isolation of tumour-infiltrating cells and lymphoid tissue cells. Mouse tumour 
samples were minced with scissors before incubation with 1.67 U ml! Liberase 
(Roche) and 0.2 mg ml“! DNase (Roche) in RPMI for 30 min at 37°C. Tumour 
samples were homogenized by repeated pipetting and filtered through a 100-1m 
nylon filter (BD Biosciences) in RPMI supplemented with 7.5% FCS to generate 
single-cell suspensions. Cell suspensions were washed once with complete RPMI 
and purified on a Ficoll gradient to eliminate dead cells. Cells from mouse spleens 
were isolated by grinding spleens through 40-|um filters. After red blood cell (RBC) 
lysis (ACK Lysing Buffer, Lonza) when required, all samples were washed and 
re-suspended in FACS buffer (PBS/0.5% albumin) or RPMI depending on further 
use. 

Flow cytometry and morphology analysis. Cells isolated from mouse tumours 
and spleens were pre-incubated (15 min, 4°C) with anti-CD16/32 monoclonal anti- 
body (Fc block, clone 2.4G, BD Biosciences) to block nonspecific binding and then 
stained (30 min, 4°C) with appropriate dilutions of various combinations of the 
following fluorochrome-conjugated antibodies: anti-CD45-AF 700 (clone 30-F11), 
anti-CD11b-APC-Cy7 (clone M1/70), anti-CD11b-PE-TR(M1/70.15), anti-Ly6G- 
APC (clone 1A8), anti-F4/80-PercP-Cy5.5 (clone BM8), anti-Ly6C-PE-Cy7 (clone 
AL-21), anti-MHC Class II-eFluor 450 (clone M5/114.15.2), anti-CD206-PE (clone 
19.2), anti-CD8-Percp-Cy5.5 (clone 53-6.7), anti-CD8-PE Texas Red (clone 5H10), 
anti-CD4-PE-Cy7 (clone RM4-5), anti-CD4-Pacific Blue (clone RM4-5), anti- 
Foxp3-APC (clone FJK-16s), anti- Foxp3-PE-Cy7 (clone FJK-16s), anti-CD25- 
APC-Cy7 (clone PC61), anti-CD44-PE-Cy7 (clone IM7), anti-CD62L-PE (clone 
MEL-14), anti-Ki67 (clone B56), anti-CTLA4-APC (clone), anti-Granzyme 
B-PE-TR (clone GB11), antibodies, all purchased from BD Biosciences, eBiosci- 
ence or invitrogen. For intracellular stain, cells were further permeabilized using 
a FoxP3 Fixation and Permeabilization Kit (eBioscience) and stained for Foxp3 
(clone FJK-16s, Alexa-Fluor-700-conjugated, eBioscience), Ki67 (clone SolA15, 
eFluor-450-conjugated, eBioscience) or CTLA4. The stained cells were acquired 
on a LSRII Flow Cytometer using BD FACSDiva software (BD Biosciences) and 
the data were processed using FlowJo software (Treestar). Dead cells and doublets 
were excluded on the basis of forward and side scatter and Fixable Viability Dye 
eFluor 506. 

Purification of myeloid or CD8* T cells from tumours or spleen. Mouse tumour 
and spleen single-cell suspensions were generated as described in the previous sec- 
tion. Tumour cells were subsequently separated from debris over a Ficoll gradient 
(Sigma-Aldrich). B cells were depleted from splenocytes using CD19 microbeads 
and LD columns according to the manufacturer’s instructions (Miltenyi Biotec) 
to enrich the myeloid fractions. Cells were stained with anti-CD45.2-Alexa- 
Fluor-700, anti-CD11b-APC-Cy7, anti CD8-PE antibodies for flow sorting on a 
FACSAria II Cell Sorter (BD Biosciences). Dead cells were excluded using DAPI 
(Invitrogen). Purity of flow-sorted populations was above 90%. 
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Isolation of Pmel lymphocytes and adoptive transfer. Spleens and lymph nodes 
from pmel-1 TCR transgenic mice were isolated and ground through 100-jm 
filters. After RBC lysis, CD8* T cells were purified by positive selection using 
Miltenyi magnetic beads. The isolated cells were loaded with CellTrace Far Red 
DDAO-s.e. (Thermo Fisher Scientific) and injected into recipient animals via tail 
vein at indicated numbers. Activated CD8* T cells were generated by culturing 
splenocytes with soluble anti-CD3 (11g ml~!, clone 145-2C11, eBioscience) and 
anti-CD28 (21g ml, clone 37.51, eBioscience) for 72h. Recombinant mouse 
IL-2 (30U ml}, Chiron) was added for the final 24h of culture. CD8* T cells were 
subsequently positively selected with Miltenyi magnetic beads before injection via 
tail vein, as described above. The frequency and proliferation of Pmel cells were 
measured 2 weeks after tumour challenge and 7 days after adoptive transfer of 
1 x 10° in vitro activated CD8* Pmel T cells using Thy1.1 antibody and by assessing 
CellTrace Far Red DDAO-s.e. dilution by flow cytometry, respectively. 

T cell suppression assay. Spleens from naive mice were isolated and ground 
through 40-\.m filters to generate a single-cell suspension. After RBC lysis, CD8* 
cells were purified using anti-CD8 (Ly-2) microbeads (Miltenyi Biotech) accord- 
ing to the manufacturer's protocol and labelled with 1 mM CFSE (Invitrogen) in 
pre-warmed PBS for 10 min at 37°C. The CFSE-labelled CD8* T cells were then 
plated in complete RPMI media supplemented with 0.05 M B-mercaptoethanol 
onto round bottom 96-well plates (25 x 10° cells per well) coated with 1 jig ml! 
anti- CD3 (clone 1454-2C11) and 51g ml7! anti-CD28 (clone 37N) antibodies. 
Purified myeloid cells were added in indicated ratios and plates were incubated at 
37°C. After 48h, cells were harvested and CFSE signal in the gated CD8* T cells 
was measured by flow cytometry (LSRII Flow Cytometer, BD Biosciences). For 
the human MDSC suppression assay PBMCs were isolated using Lymphoprep 
from donor blood. T cells were isolated by CD3* selection (Easysep, Stem Cell 
Technologies) and frozen in Sigma freezing media for later use. The remaining 
PBMCs without T cells were incubated with 20ng ml”! GMCSF and 20ng ml! 
IL-6 for 6 days to differentiate the myeloid cells and were incubated with or without 
added IPI-549. After 6 days, the MDSC cells were isolated by CD33* selection 
(Easysep, Stem Cell Technologies). These MDSC cells were then mixed with the 
autologous T cells (at a 1:4 ratio) that had been pre-stained with cell trace violet 
and activated with anti-CD3 and anti-CD28 beads (Dynal). The T cell prolifera- 
tion is determined by Cell Trace Violet dye dilution measured by flow cytometry 
after 72h. 

IFN ELISPOT assay. Blood was collected from CT26 tumour-bearing mice 
after 10 days of treatment with either vehicle or IPI-549 (15mg kg~'). PBMCs 
were isolated using Lymphoprep density gradient media (Stem Cell Technologies, 
Vancouver, BC). IFN producing cells were quantified using the CTL Mouse 
Immunospot IFNy Single Colour ELISPOT kit (CTL, Shaker Heights, Ohio) 
according to the manufacturer’s instructions. For in vitro re-stimulation, 1 x 10° 
PBMC were co-cultured with 1 x 10° irradiated (2,000 rad) CT26 colon carcinoma 
target cells in CTL test media (CTL) for 16h. Irradiated (2,000 rad) 4T1 mammary 
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carcinoma target cells were used as negative control targets to assess specificity. 
IFN‘ spots were quantified using a CTL Immunospot S6 Micro Analyzer and 
Immunospot Professional Software (CTL). 
Macrophage polarization assay. Bone marrow derived macrophages were 
prepared from C57BI/6 mice femur and tibias. Red blood cells were lysed and 
then the remaining cells were plated in bone macrophage media (BMM) con- 
sisting of DMEM, 20% FBS plus penicillin/streptomycin and 50ng ml-' M-CSF 
and incubated for 6 days. Cells were polarized towards an M2 phenotype with the 
addition of 20ng ml“! IL4 and 50ng ml“! M-MCSF (both from R&D Systems) 
with or without added IPI-549. Cells were incubated for 48h and then RNA was 
harvested from the cells (Qiagen RNeasy). qRT-PCR was performed using primers 
for mouse ARG] (Mm00475988_m1, Life Technologies NY) and mouse {-actin 
(Mm00607939_s1 Life Technologies, New York). 
Biochemical and cell-based assays. Biochemical and cell-based assays for the 
Class I and Class II PI3K isoforms were run as previously described’*”°. 
PI3K~-specific whole-blood PD assay. Whole blood from six healthy donors 
was pre-treated with a dose titration of IPI-549 and then stimulated for 2 min with 
2.3 ug ml"! CXCL12. Cells were lysed, fixed and stained. Human blood samples 
were collected after ICF approval. Response to stimulation was determined by 
measuring phosphorylation of AKT $473 in monocytes (CD14*) by flow cytom- 
etry and comparing the value to that of untreated controls. ICs values for IPI-549 
were calculated for each donor, measuring compound potency against PI3K~ and 
in whole blood. 
Statistics. Where indicated, data were analysed for statistical significance and 
reported as P values. Data were analysed with a Mann-Whitney test when com- 
paring means of two independent groups and two-way ANOVA when comparing 
more than two groups. P < 0.05 was considered statistically significant (*P < 0.05, 
**P< 0.01, ***P < 0.001, ****P < 0.0001). Evaluation of survival patterns in 
tumour-bearing mice was performed using the Kaplan-Meier method, and results 
were ranked according to the Mantel-Cox log-rank test. P< 0.05was considered 
statistically significant. Survival was defined as mice with tumours <2,500 cm}. 
The experiments were not randomized. The investigators were not blinded to 
allocation during experiments and outcome assessment. Experiments have been 
repeated to ensure reproducibility of the observations. 
Data availability. Source Data for the main figures are provided in the online 
version of this paper. 
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Extended Data Figure 1 | Effect of suppressive myeloid TILs in response 
to checkpoint blockade. a, Individual tumour growth of subcutaneous 
(4T1) or intradermal (B16, B16-GMCSF) implants in anti-PD-1-, 
anti-CTLA4- or control-treated mice (n= 10). b, In vitro suppressive 
activity of tumour-infiltrating CD11b* cells purified from spleen of 4T1, 


B16, B16-GMCSF tumour-bearing mice. Representative histograms of 
CD8* T cell proliferation at corresponding CD11b* to CD8* T cell ratio 
(left panel) and quantification of CD8* T cell proliferation (right panel) 
(n=3), mean+s.e.m. *P < 0.05, **P< 0.01, ***P< 0.001, ****P<0.0001 
(non-parametric Mann-Whitney U-test). 
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Extended Data Figure 2 | Effect of selective PI3K~y inhibition on infiltrating leukocytes from IPI-549- versus vehicle-treated CT26 tumour- 
tumour growth and myeloid TILs. a, Binding affinities (Ky) and cellular bearing mice. d, RNA-seq of co-stimulatory and checkpoint molecules 
ICs inhibition of pAKT by IPI-549 for class I PI3K isoforms (left table). on whole tumours from CT26 tumour-bearing mice treated for 6 or 
Percentage of inhibition of expression on bone-marrow-derived 9 days with IPI-549 compared to vehicle. e, Mean tumour volume of 
macrophages polarized with M-CSF and IL-4, (right panel). b, Percentage subcutaneous LLC-Brei implants in IPI-549- versus vehicle-treated mice 
of tumour growth inhibition in LLC, MC38, 4T1, CT26, B16-GMCSF without or after CD11b* cell depletion. Data represent analysis of 5-10 
tumour-bearing mice treated with IPI-549 (table). c, Quantification mice per group, mean +s.e.m. *P < 0.05, ***P < 0.001 (non-parametric 
of CD11b, CD206, NOS2 and PD-L1 expression in CD11b* tumour- Mann-Whitney U-test). 
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Extended Data Figure 3 | Effect of selective PI3K~ inhibition on subsets 
of CD11b myeloid cells. a, Representative flow cytometry analysis 

and quantification of Ly6C, MHC class II expression in CD11b*Ly6G~ 
cells infiltrating 4T1 tumours. b, mRNA expression of selected M1 

and M2 markers in sorted Ly6C’°YMHCIT’ (TAM-M2) compared to 
Ly6C!’’MHCII"8" (TAM-M1) population from 4T1 tumour, data were 
relative to GAPDH expression and normalized versus the mean of TAM- 
M1 population. Mean + s.e.m. *P < 0.05, **P< 0.01, ***P< 0.001, 

* P < ().0001 (non-parametric Mann-Whitney U-test). 
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analysis and quantification of CD62L and CD44 expression in CD8* and (non-parametric Mann-Whitney U-test). 
CD4* T cell infiltrates in tumour, lymph node (LN) and spleen of 4T1 
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Extended Data Figure 8 | Effect of combination of a selective PI3K 
inhibitor with checkpoint blockade on TILs. a, Mean tumour volume 
of subcutaneous 4T1 tumour-bearing mice treated with IPI-549, 

vehicle or anti-PD-1 in combination with IPI-549 or vehicle (n= 10). 

b, Representative flow cytometry analysis of CD206 and MHCII labelling 
in CD11bt F4/80* cell populations in the different treatment groups of 
4T1 tumour-bearing mice. c, Quantification of CD11b*F4/807, M1/M2 


% of CD8+ 


ratio, CD8*/Tyeg in TILs and granzyme B expression in CD8* T cells 
from 4T1 tumours in the different treatment groups. d, Quantification 
of CD11btF4/80*, M1/M2 ratio, CD8*/Treg in TILs and granzyme B 
expression in CD8* T cells from B16-GMCSF tumours in the different 
treatment groups, mean + s.e.m. *P < 0.05, **P < 0.01 (non-parametric 
Mann-Whitney U-test). 
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Extended Data Figure 9 | Effect of combination of a selective PI3K 
inhibitor with checkpoint blockade on acquisition of anti-tumour 
memory. a, Tumour re-challenge at 100 days (from first tumour implant) 
following primary tumour complete response in B16-GMCSF tumour- 
bearing mice treated with vehicle (blue) or IPI-549 (red) in combination 
with both anti-PD1 and anti-CTLA4. b, CT26 tumour-bearing mice with 


complete responses in the anti-PD-1 treatment group and the IPI-549 
plus anti-PD-1 combination treatment group were re-challenged with 
CT26 tumour implant. Additional mice with complete responses from the 
IPI-549 plus anti-PD-1 combination were implanted with 4T1 tumours. 
There was a low or no tumour take with CT26 re-challenge, while all 4T1 
tumours grew, indicating specific anti-tumour memory. 
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Alternative modes of client binding enable 
functional plasticity of Hsp70 


Alireza Mashaghi'*, Sergey Bezrukavnikov!*, David P. Minde!, Anne S. Wentink?*, Roman Kityk?, Beate Zachmann-Brand**, 
Matthias P Mayer?, Giinter Kramer?’, Bernd Bukau? & Sander J. Tans! 


The Hsp70 system is a central hub of chaperone activity in all 
domains of life. Hsp70 performs a plethora of tasks, including 
folding assistance, protection against aggregation, protein 
trafficking, and enzyme activity regulation’, and interacts with 
non-folded chains, as well as near-native, misfolded, and aggregated 
proteins® '°. Hsp70 is thought to achieve its many physiological roles 
by binding peptide segments that extend from these different protein 
conformers within a groove that can be covered by an ATP-driven 
helical lid!!~!5. However, it has been difficult to test directly how 
Hsp70 interacts with protein substrates in different stages of folding 
and how it affects their structure. Moreover, recent indications of 
diverse lid conformations in Hsp70-substrate complexes raise the 
possibility of additional interaction mechanisms!*'*, Addressing 
these issues is technically challenging, given the conformational 
dynamics of both chaperone and client, the transient nature of their 
interaction, and the involvement of co-chaperones and the ATP 
hydrolysis cycle’®. Here, using optical tweezers, we show that the 
bacterial Hsp70 homologue (DnaK) binds and stabilizes not only 
extended peptide segments, but also partially folded and near-native 
protein structures. The Hsp70 lid and groove act synergistically 
when stabilizing folded structures: stabilization is abolished when 
the lid is truncated and less efficient when the groove is mutated. 
The diversity of binding modes has important consequences: Hsp70 
can both stabilize and destabilize folded structures, in a nucleotide- 
regulated manner; like Hsp90 and GroEL, Hsp70 can affect the late 
stages of protein folding; and Hsp70 can suppress aggregation by 
protecting partially folded structures as well as unfolded protein 
chains. Overall, these findings in the DnaK system indicate an 
extension of the Hsp70 canonical model that potentially affects a 
wide range of physiological roles of the Hsp70 system. 

To study the DnaK system at the single-molecule level, we first used 
an engineered substrate that is prone to misfolding in vitro: four maltose 
binding proteins linked head-to-tail (4MBP, Fig. 1a). Mechanical 
stretching of 4MBP in the absence of chaperone”! first produced a 
gradual unfolding of the C-terminal segments of all four MBP proteins 
(Fig. 1b, N — 4), followed by the distinct unfolding events of the four 
remaining core structures (Fig. 1b, 4 3-2 1 — U). Relaxation to 
low forces and a waiting period of 5 s provided an opportunity to refold. 
However, while subsequent stretching showed a compact structure 
had formed, these structures typically failed to unfold (termed tight 
misfold, Fig. 1b) or unfolded in steps larger than one MBP core (termed 
weak misfold). Isolated 4MBP thus formed small aggregate structures 
consisting of more than one MBP repeat (Fig. 1c). Next, we performed 
these experiments in a buffer containing ATP and the DnaK chaperone 
system (DnaK, DnaJ, GrpE; Fig. 1d and Extended Data Figs 1-4). 
Unfolding and relaxation now rarely produced tight misfolds, though 
weak misfolds still formed (Fig. le, f). Moreover, the data showed more 
unfolding steps corresponding to a single MBP core (P < 0.05, Fig. le, 


green arrows), indicating the native-like refolding of core structures. 
These results show that the DnaK system can promote the correct 
refolding of a misfolding- and aggregation-prone protein, consistent 
with previous bulk studies of Hsp70”*:3. However, they do not reveal 
how the various components of the DnaK system affect client confor- 
mations during the different stages of folding. 

To understand the underlying mechanisms we focused on the 
simpler case of a single MBP to preclude aggregation and omitted 
the co-chaperones (Fig. 2a). Multiple protocols were followed. First, 
we unfolded MBP, which occurred in a two-step manner via the core 
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Figure 1 | DnaK chaperone system suppresses aggregation and 
promotes refolding. a, Schematic diagram of experiment. Tandem of MBP 
monomers (4MBP, blue) is attached to DNA linker (grey) and tethered 
between beads”’. b, 4MBP in the absence of chaperones. Thick grey line, 
first stretching curve showing native unfolding. Dark and light blue lines, 
subsequent stretching curves showing different misfolded states. Triangles, 
types of events or states (see panels c and f and main text). Thin grey lines, 
theoretical compliance for native (N) to unfolded state (U). Numbers of 
folded core structures are indicated. c, Corresponding event fractions 

(n= 38). Intermediate, unfolding events with unfolded length shorter than 
one MBP core. d, Schematic diagram of experiment. e, Stretching curves 
of 4MBP in presence of DnaK system and ATP. f, Corresponding event 
fractions (n=55). 
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relaxed in isolation and then incubated at low force with 11M DnaK and 
ADP loading buffer (n = 35). Dotted line, 100nM Dnak. f, g, Stretching 
of MBP that was incubated in the unfolded state with 1 1.M DnaK and 
ADP loading buffer and then relaxed (n = 15). Dotted line, 100nM Dnak. 
h, i, Stretching of MBP that was unfolded and relaxed with DnaK and 
ATP (n= 24). Pie charts indicate states observed after low-force period: 
compact (native or core-like), intermediate between core and unfolded, 
and unfolded. Grey and black lines, theoretical compliance for folded 
and unfolded MBP. j, Unfolding forces for core (C) and intermediate (i) 
states for panels b, d, f and h. Last column, DnaK trapped in ATP state 
(Extended Data Fig. 5j, k; n= 18). Whiskers, 90% and 10% extreme values; 
central band, median; rectangle symbol, mean. 
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state”?! (Fig. 2b), and then performed cycles of relaxation, waiting at 


0 pN for 5s, and subsequent stretching, all in the absence of chaperones. 
Stretching indicated the chain either had remained unfolded, or had 
refolded to a compact structure that typically unfolded in the same 
two-step manner (Fig. 2b). The refolding probability P, was about 0.85 
(Fig. 2c), which increased for longer waiting periods at 0 pN7°?!. 
Second, the cycle was paused at 0 pN, DnaK was added and allowed 
to interact. The Dnak lid is thought to be predominantly open in the 
presence of ATP and mostly closed after hydrolysis to ADP. We thus 
used 1mM ADP buffer with small amounts (~5%) of ATP (termed 
ADP loading buffer) in order to stall the ATP cycle after hydrolysis and 
promote DnaK-MBP binding. Stretching-relaxation cycles were then 
resumed for the same molecule. Surprisingly, the protein structures 
now typically failed to unfold over multiple such cycles until the tether 
broke. The structures were stable at high forces (maximally sustained 
force Fy > 40 pN) well above the native unfolding force (22 +5 pN), 
and sometimes even exceeded the effective maximum for our assay 
(65 pN)” (Fig. 2d, j). 

Notably, this scenario is unlike 4MBP aggregation (Fig. 1b), as aggre- 
gation partners are not available here. Instead, the data suggest that in 
ADP loading buffer, DnaK bound to folded structures and stabilized 
them against forced unfolding. When MBP molecules were stretched 
for the first time in the presence of DnaK and ADP loading buffer, they 
unfolded in similar manner as isolated MBP without DnaK (Fy=23 +4 
pN), which indicates that DnaK does not interact with fully native MBP, 
or only weakly. Rather, the data suggest that MBP is in a near-native 
state when stabilized by DnaK. Specifically, the MBP states stabilized 
after DnaK addition (Fig. 2d) had folded in the absence of chaperones, 
which yields native-like states (Fig. 2b). The stabilized structures 
were typically compact, much like the native or core MBP states, but 
could also display lengths in-between those of the core and unfolded 
states (Fig. 2d, e). Native MBP could also be stabilized after a minimal 
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perturbation, by unfolding the C-terminal segment in a first pull and 
then relaxing the force to preserve the native core state (Extended Data 
Fig. 5a, b). Overall, these data indicate that DnaK can stabilize partially 
folded and near-native MBP states against forced unfolding. 

Previous work had suggested, however, that DnaK rather stabilizes 
the unfolded state of substrates (Extended Data Figs 2-4)*°. With the 
aim of promoting this binding mode, we modified the protocol. After a 
few unfolding-refolding cycles, unfolded MBP was kept at a moderate 
force to prevent refolding, and incubated for 1 to 2 minutes with 
DnaK and ADP loading buffer. In subsequent relaxation-stretching 
cycles, MBP now remained unfolded (P, =~ 0.05; Fig. 2f, g), indicating 
that DnaK had bound to and stabilized the unfolded chain. Next, we 
reduced the DnaK incubation time from minutes to seconds by first 
unfolding MBP in the presence of DnaK and ADP loading buffer, 
and immediately subjecting it to relaxation and stretching (Extended 
Data Fig. 5c, d). Unfolded chains now did refold in subsequent cycles. 
Moreover, the refolded structures typically unfolded at increasingly 
high forces, until they were locked into a compact state that could not 
be unfolded (Extended Data Fig. 5d). Consistently, such a compact 
and locked state was achieved in earlier cycles for longer waiting 
times at 0 pN (Extended Data Fig. 5e, f). These data suggest a kinetic 
partitioning: unfolded chains can be bound by DnaK**, which blocks 
their refolding, unless the chains fold first (folding time is about 1s for 
MBP)” and are then stabilized by DnaK in a folded state. 

The observed stabilization by DnaK also occurred during active ATP 
hydrolysis. In the presence of DnaK and 1mM ATP, we found that 
refolded structures frequently unfolded via partially folded structures 
smaller than the core state (44% of the traces, lifetime of partial 
structure T= 1.1 +0.6s, Fig. 2h, i). For example, the light green curve 
in Fig. 2h indicates a partially folded structure beyond 5 pN that was 
stable for 7+ 4s, and then unfolded fully at 30 pN. This observation 
suggested the folded structure was stabilized by DnaK, and then 
unfolded when DnaK was released as it progressed through the ATP 
cycle. Consistently, the occurrence of transiently stable partial folds was 
higher than without chaperones (P< 0.01, 8% of the traces, 70.53), 
but lower than with DnaK and ADP loading buffer (P< 0.01, all traces, 
T <150s). The stabilization of extended and folded states by DnaK was 
also observed at lower DnaK concentrations (Fig. 2d, f and Extended 
Data Fig. 5g), and in presence of HPLC purified (ATP free) ADP 
(Extended Data Fig. 5h, i). Consistently, stabilization was not observed 
for a DnaK mutant (T199A) that is trapped in the open ATP state 
(Fig. 2j and Extended Data Fig. 5j, k). Overall, these results indicate 
that stabilization of partially folded structures by DnaK is modulated 
by its ATP hydrolysis cycle. 

We next attempted to identify which structural elements of the 
Hsp70 substrate-binding domain are conducive to stabilizing folded 
MBP conformations. The helical lid may have a role, as implied by 
the observation that ATP affects both binding to folded structures 
(Fig. 2d, h) and lid conformation”®, but this evidence is indirect. To 
address this issue more directly, we investigated DnaK with a truncated 
lid (DnaK (2-538), Extended Data Fig. 6)**. In the presence of this variant 
and ADP loading buffer, refolded MBP molecules did not show stabili- 
zation but rather native-like core unfolding, whether the waiting time 
at 0 pN was 5s (F, =22+£5 pN, Fig. 3a, e) or minutes (Fy =24+7 pN, 
Extended Data Fig. 51). By contrast, stabilization was observed after 
minute-long exposure at 0 pN to a DnaK variant that has a mutated 
binding groove but retains the lid (DnaK(V436F), Fig. 3c). Unfolding 
then occurred at forces up to 65 pN (Fig. 3c, e), and often (52% of 
the traces) via partial folds that are only scarcely observed without 
chaperone’. The unfolding forces were lower than for wild-type 
DnaK however (P <0.01, Fig. 3e). This groove mutation is reported to 
reduce peptide affinity by more than tenfold”. Consistent with such 
a peptide-binding defect, we found that after minute-long exposure 
to groove-mutated DnaK in the extended state, MBP chains formed 
folded structures when relaxed to 0 pN for 5s (n= 13), while refolding 
was efficiently blocked in the wild-type DnaK exposure experiments 
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Figure 3 | DnaK lid is central to stabilizing folded structures and 
suppressing aggregation. a, b, Stretching of MBP that was unfolded and 
relaxed with lid-truncated DnaK(2-538) in ADP loading buffer (n= 15). 
Waiting time at low force is 5s. c, d, Stretching of MBP that was unfolded 
and relaxed with groove-mutated DnaK(V436F) in ADP loading buffer 
(n=21). Waiting time at low force is of the order of minutes; pie charts 
indicate subsequently observed states: compact (native or core-like), 
intermediate between core and unfolded, and unfolded. e, Unfolding forces 
for core (C) and intermediate (i) states. Whiskers, 90% and 10% extreme 
values; central band, median; rectangle symbol, mean. f, Red squares 
qualitatively indicate observed stabilization of folded and extended states 
by DnaK variants. g, 4MBP refolding statistics. Indicated is type of event as 
fraction of all events. Error bars, +1 s.d. h, Mean unfolding force of non- 
native structures formed during 4MBP refolding. Error bars, +1 s.e.m. 


(Fig. 2f, g). Lid-truncated DnaK on the other hand, which retains the 
wild-type groove, was found to suppress refolding in regular stretch- 
relax cycles (P< 0.01, Fig. 3a, b). Thus, the stabilization of folded 
structures by DnaK is abolished by truncating the lid and weakened 
by mutating the groove. 

We surmised that binding partially folded structures could help 
to suppress aggregation®”> (Fig. le, f). If so, groove-mutated DnaK 
should retain some of this capability, as it binds folded structures 
(Fig. 3c). We found that the presence of groove-mutated DnaK and 
1mM ATP indeed decreased tight aggregation of 4MBP (P< 0.01, 
Fig. 3g and Extended Data Fig. 7). The aggregated structures that 
did form, unfolded at lower forces (P< 0.01, Fig. 3h). These findings 
are consistent with models where DnaK can destabilize misfolded 
structures*. The results may appear paradoxical: how can DnaK both 
result in more- (Fig. 2d) and less-stable (Fig. 3h) structures? The ATP- 
driven DnaK cycle offers a possible rationale. First, DnaK stabilizes 
in the ADP state, and hence only transiently. Moreover, by binding 
less-stable structures early after relaxation, DnaK can limit more-stable 
structures that would form later (Fig. 4a). Indeed, aggregates may have 
different structure and be less stable when formed with DnaK present. 
DnaK may also discriminate between native and non-native structures, 
and directly destabilize the latter*. We find DnaK selectively stabilizes 
partially and not fully folded MBP, though Dnak selectivity may go 
further. Overall, the data indicate that aggregation can be suppressed 
by the transient stabilization of (partially) folded structures. 

Second, other components of the DnaK system may also contribute 
to limiting aggregation. With the full DnaK system present, tight 
aggregation was less frequent than for DnaK and ATP, though the 
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Figure 4 | Model of Hsp70 chaperone action. a, Effect on substrate 
stability. Hsp70 can transiently stabilize partially folded structures that 
are less stable than complete folds upon Hsp70 release. Hsp70 binding can 
suppress the formation of permanently stable aggregates. ATP and co- 
chaperones provide stability control. For instance, high ADP and low ATP 
levels or low nucleotide exchange rates with heat-denatured GrpE*!, or Hip 
in mammalian cells*®, can prolong Hsp70-mediated stabilization. Hsp70 
could bind and (de)stabilize certain structural motifs. b, Hsp70 binding 
modes and roles of domains. This binding diversity and resulting ability 
to stabilize and destabilize folded structures could affect a wide range of 
Hsp70 physiological roles. 


difference was not large (P < 0.01, Fig. 3g). The co-chaperone GrpE did 
not display any substrate interactions by itself (Extended Data Fig. 8e). 
By contrast, DnaJ alone completely abolished aggregation (Extended 
Data Fig. 8a, b), but blocked refolding as well (Extended Data Fig. 8c, d). 
Within the full DnaK chaperone system, the holdase activity of DnaJ 
did not dominate, as efficient refolding was observed (see Fig. Le, f). 
These findings thus support previous observations that with ATP and 
GrpE, DnaK can promote DnaJ release from substrate”. 

Finally, the observations led us to speculate that DnaK could preserve 
protein structure and function under heat stress. As DnaK unfolds at 
lower temperatures than MBP, the latter is unsuitable. We instead inves- 
tigated a monomeric variant of the replication initiation protein RepE 
(RepE54, Extended Data Fig. 4d), a well-established DnaK substrate”’, 
using tryptophan fluorescence. We found that with (tryptophan-free) 
DnaK and ADP, the apparent thermal stability of RepE54 increased 
by about 5 °C (Extended Data Fig. 9). This upshift was not observed 
for DnaK and ATP or for lid-truncated DnaK with ADP, consistent 
with central roles for the ADP state and the lid in the stabilization of 
folded states. Groove-mutated DnaK with ADP also did not show an 
upshift, suggesting that the groove was more important here than in 
the single-molecule MBP experiments. We further observed that DnaK 
suppressed the temperature-induced decrease of luciferase activity in 
the presence of ADP (Extended Data Fig. 4c), whereas the ADP buffer 
alone did not (Extended Data Fig. 10). Together, these data suggest that 
Dnak can stabilize protein structure and activity during heat stress 
when ATP abundance is limited?. 

From these experiments on Dnak, a picture emerges of an Hsp70 
functional repertoire that is broader than previously assumed (Fig. 4b): 
1) Hsp70 not only binds extended peptides*!>!®5, but also directly 
binds (partially) folded structures. The Hsp70 lid is central and could 
adopt reported open conformations!>-!”” to accommodate and bind 
folded structures, though indirect action cannot be excluded, while 
the groove also conveys stability. 2) In the canonical model, protein 
chains fold autonomously after Hsp70 release, and can subsequently 
be transferred to other chaperones that act in later stages, such as 
Hsp90 and GroEL!"!°. Our findings suggest Hsp70 can also guide and 
organize late stages of folding’, for instance by limiting inter-domain 
contacts”, and may not fully release protein substrates until near native. 
3) Hsp70 can protect partially folded structures against aggregation 
in addition to extended peptide segments”®. 4) Hsp70 is known to 
destabilize (mis)folded protein structures*. Hsp70 could achieve this 
role by interacting directly with misfolded and aggregated structures. 
Furthermore, our data show that Hsp70 can also do the opposite, and 
stabilize partially folded protein structures. These contrasting effects 
may be controlled by ADP abundance and co-chaperones, which can 
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for instance modulate the lifetime of the ADP state. 5) These findings 
suggest that Hsp70 can preserve enzymatic functions during stress at 
low energy costs, in line with the reduced energy availability in episodes 
of stress. 

These Hsp70 functions could have important consequences for our 
understanding of normal cellular physiology as well as pathophysiology. 
Hsp70 may transfer clients to other chaperones*? without requiring 
(local) unfolding, and contribute to an evolutionary capacitor role by 
stabilizing substrate proteins”’. Co-factors such as HIP*” that promote 
the ADP state could have a role in regulating client stabilization. Finally, 
Hsp70 could control signalling pathways° by interacting directly with 
the folded parts of transcription factors, kinases, and receptors, in 
addition to their unfolded segments*”. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 
Expression and purification of MBP and 4MBP. N-terminally biotinylated 
MBP and 4MBP C-terminally fused with 4 Myc tag sequences were produced 
in Escherichia coli as hybrid proteins consisting of an N-terminal Ulp1-cleavable 
N-terminal His;9-SUMO sequence followed by an AviTag sequence (Avidity, LCC, 
Aurora, Colorado, USA), facilitating in vivo biotinylation and four consecutive 
C-terminal Myc-tag sequences. Proteins were purified from E. coli BL21(DE3) 
cells harbouring pBirAcm encoding the biotin ligase (Avidity, LCC, Aurora, 
Colorado, USA). For overexpression overnight cultures were diluted 1:100 in 
fresh LB medium supplemented with 20 mg/l biotin, 20 mg/l kanamycin, 10 mg/l 
chloramphenicol, 0.2% glucose and incubated under vigorous shaking at 30°C. 
Expression was induced at OD¢09 = 0.6 by addition of 1mM IPTG for 3h. Cells 
were chilled, harvested by centrifugation, flash-frozen in liquid nitrogen and stored 
at —70°C. Cell pellets were resuspended in ice-cold buffer A (20mM Tris-HCl 
pH 7.5, 0.2M NaCl, 1% Triton X-100, 1mM PMSF) and lysed using a French 
pressure cell. The lysate was cleared from cell debris by centrifugation at 35,000g 
for 30 min and incubated with Ni-IDA matrix (Protino; Macherey-Nagel, Diiren, 
Germany) for 30 min at 4°C. The matrix was washed extensively with buffer A 
and bound hybrid proteins were eluted in buffer A containing 250 mM imidazole. 
The eluate was supplemented with His6-Ulp1 protease and dialysed overnight at 
4°C in buffer D (20 mM Tris-HCl pH 7.5, 0.2 M NaCl). Following dialysis coupled 
with Ulp1 digestion, His6-Ulp1 protease and the His,o-SUMO fragment were 
removed by incubation with Ni-IDA matrix. The unbound fraction containing 
cleaved MBP or 4MBP was then loaded on Amylose resin (New England Biolabs) 
previously equilibrated in buffer D, washed with cold buffer D and bound proteins 
were eluted in buffer D supplemented with 20 mM maltose. Elution fractions were 
dialysed three times for 2h at 4°C in 100-fold excess volume of buffer S$ (20 mM 
Tris-HCl pH 7.5, 0.2 M NaCl, 1mM EDTA). 4MBP purifications in addition were 
subjected to size-exclusion chromatography using a HiLoad 16/600 Superdex prep 
grade column. Purified proteins were concentrated using Vivaspin centrifugal 
concentrators, aliquoted, flash frozen in liquid nitrogen and stored at —70°C. 
Biotinylated proteins were also produced as follows. Purification of MBP and 
4MBP containing one N-terminal cysteine was performed as described”°. For 
biotinylation the purified proteins were dialysed in buffer B (20mM Tris-HCl 
pH 7.5, 0.2M NaCl, 1mM EDTA) and incubated on ice for 15 min in buffer B 
containing 5mM TCEP to reduce disulphide bonds. A tenfold molar excess of 
maleimide-PEG11-biotin (Thermo Scientific) dissolved in DMSO was added 
and biotinylation was performed for 2h at 25°C. Proteins were concentrated and 
subjected to size-exclusion chromatography using a HiLoad 16/600 Superdex prep 
grade column. Purified fractions were analysed by SDS-PAGE, pooled, aliquoted, 
flash frozen in liquid nitrogen and stored at —70°C. 
Wild-type and mutant chaperone expression and purification. All proteins were 
produced in E. coli from expression plasmids. Wild-type DnaK, DnaK(2-538) 
(lid-truncated), and DnaK-V436F (groove-mutated) were produced in a AdnaK 
strain and purified by ammonium sulphate precipitation, anion exchange chroma- 
tography (DEAE-sepharose), ATP-agarose affinity chromatography, gel-filtration 
and strong anion exchange chromatography (Resource Q) as described earlier*”. 
DnaJ was purified by chromatography on cation exchange and hydroxyapatite 
material. GrpE was expressed in a AdnaK strain and purified by chromatography 
on DEAE-Sepharose, hydroxyapatite and Superdex 200 as described* Tryptophan- 
free DnaK mutants (W102F) were expressed with an N-terminal His,o-SUMO 
sequence and purified by Ni-affinity chromatography, Ulp1 digestion and a second 
Ni-affinity chromatography step. 
Luciferase activity protection assay. Luciferase (80 nM in 40 mM HEPES/KOH 
pH 7.5, 50mM potassium acetate, 5mM magnesium acetate, 2mM DTT) was 
incubated in the absence or presence of ATP (2mM), ADP (3.241M), wild-type 
or mutant DnaK proteins (800nM), DnaJ (160nM) and GrpE (400 nM) at 37°C. 
Luciferase activity was determined by diluting 1 ,1l aliquots into 12411 assay buffer 
(100 mM potassium phosphate (pH 7.8), 25 mM glycylglycine (pH 7.4), 100 mM 
potassium acetate, 15 mM magnesium acetate, 5mM ATP), mixing with 1251 
luciferin (801M in assay buffer) and monitoring bioluminescence for 5s without 
delay using a Biolumat (Berthold). 
RepE54 thermal unfolding assay. RepE54 (21M in 50mM HEPES/KOH pH 7.5, 
50mM KCl, 5mM MgCl, 2mM DTT) was incubated at 25°C in the absence 
or presence of ATP (2mM), ADP (2mM), ADP loading buffer conditions 
(2mM ADP +5% ATP), tryptophan-free DnaK wild-type or mutant proteins 
(21M) for 30 min. The unfolding transition of RepE54 was then monitored by 
fluorescence emission at 329 nm (excitation at 295 nm) as a function of temperature 
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(temperature gradient from 25-65 °C at a rate of 90°C/hour) on a Jasco FP-6500 
spectrofluorometer. 

Optical tweezers assay. Anti-digoxigenin (anti-Dig) coated beads (DIGP-20-2, 
diameter 2.1 mm) were purchased from Spherotech and stored at 4°C until use. 
2,553-base-pair DNA handles were prepared and functionalized with digoxi- 
genin (Dig) and biotin at the 5’-ends of both strands as described previously”". 
DNA-coated microspheres were made by mixing ~70 ng of biotin-dsDNA-Dig 
and 411 Dig-coated beads in 10 ml HMK buffer (50 mM HEPES, pH 7.5, 5mM 
MgCl2, 100mM KC)). After a 30 min incubation on a rotary mixer (4°C), ll 
of neutravidin solution (1% w/v) was added to the mixture and incubated at 
4°C for 10 more minutes. Next, the unbound neutravidin was washed away by 
centrifugation, and the beads were dissolved in 400 il HMK buffer for use in optical 
tweezers experiments. Carboxylated polystyrene beads (CP-20-10, diameter 2.1 1m, 
Spherotech) were covalently attached to anti-Myc antibody (Roche Diagnostics) 
via carbodiimide reaction (Poly-Link Protein Coupling Kit, Polysciences Inc.). 
Briefly, the beads were washed and then mixed with freshly prepared 1-ethyl- 
3-(3-dimethylaminopropyl) carbodiimide and the antibody, and the mixture 
was incubated for 3h. MBP-coated beads were made by mixing 511 of 501M 
solution of either 1MBP or 4MBP and 211 Myc-coated beads in 10 pl HMK buffer 
and incubating the mixture for 30 min on a rotary mixer at 4°C. Subsequently, 
the beads were dissolved in 4001] HMK buffer for use in optical tweezers 
experiments. 

Optical tweezers assays, buffer contents: Fig. 1b, c, no chaperones; Fig. le, f, 
100nM Dnak, 100nM DnajJ, 50nM GrpE, 1 mM ATP; Fig. 2b, c, no chaperones; 
Fig. 2d-g, 11M DnaK, 1mM ADP loading buffer; Fig. 2h, i, 100 nM DnaK, 1mM 
ATP; Fig. 3a, b, 1j1M Dnak lid-truncated, 1mM ADP loading buffer; Fig. 3c, d, 
141M DnaK groove-mutated, 1 mM ADP loading buffer; Extended Data Fig. 5b, 
100nM DnaK, 1mM ADP loading buffer; Extended Data Fig. 5c-f, 1}1M DnaK, 
1mM ADP loading buffer; Extended Data Fig. 5g, 100nM DnaK, 1mM ADP load- 
ing buffer; Extended Data Fig. 5h, i, 11M DnaK, 100M ADP purified by HPLC; 
Extended Data Fig. 5j, k, 100nM DnaK T199A, 1mM ATP; Extended Data Fig. 51, 
1,.M lid-truncated DnaK and 1mM ADP loading buffer; Extended Data Fig. 7b, 
100nM Dnak, 1 mM ATP; Extended Data Fig. 7c, 100nM Dnak lid-truncated, 
1mM ATP; Extended Data Fig. 7d, 100nM DnaK groove-mutated, 1mM ATP; 
Extended Data Fig. 8b, 1 tM DnaJ; Extended Data Fig. 8c, 100nM DnaJ; Extended 
Data Fig. 8d, 11M GrpE. Number of events (high force): n = 38 (Fig. 1b, c), 
55 (Fig. le, f), 52 (Extended Data Fig. 7b), 131 (Extended Data Fig. 7c), 
96 (Extended Data Fig. 7d). Number of events (low force): n = 50 (Fig. 2b, c), 
35 (Fig. 2d, e), 15 (Fig. 2f, g), 24 (Fig. 2h, i), 15 (Fig. 3a, b), 21 (Fig. 3c, d), 
6 (Extended Data Fig. 5b), 10 (Extended Data Fig. 5c, d), 11 (Extended Data 
Fig. 5e, f), 17 (Extended Data Fig. 5g, h), 15 (Extended Data Fig. 5i, j), 4 (Extended 
Data Fig. 8b), 9 (Extended Data Fig. 8c), 10 (Extended Data Fig. 8d). ADP loading 
buffer was prepared by dissolving ADP sodium salt (A2754, Sigma Aldrich) in 
HMK buffer. The ATP fraction in this buffer was ~5%, as measured by HPLC. 

Stretching experiments were performed at room temperature using a custom 
made optical tweezers setup!*. An MBP-coated bead was trapped in the optical 
trap and then transferred to a micropipette tip, and, subsequently, a DNA-coated 
bead was trapped. Next, two beads were brought in close contact, allowing a tether 
between the beads to form. When the micropipette is moved away from the trap, 
the tether experiences a stretching force, leading to its extension and deflection of 
the trapped bead from the trap centre. Specific tethering to natively folded MBP 
molecules was detected by the typical two-step unfolding upon stretching. Tethers 
that broke at low force or showed no unfolding, for instance because of non- 
specific binding of the neutravidin at the end of the DNA tether to the bead surface, 
were excluded. The deflection is monitored by collecting the transmitted laser light 
and directing it to quadrant photodiode where it is recorded at 50 Hz. The data 
are filtered with 5th order Butterworth filter at 20 Hz before saving. Trap stiffness 
and sensitivity were measured to be 169 + 24 pN/m and 2.74 + 0.24 V/pm 
correspondingly. A nanopositioning piezo stage is used to move the flow chamber 
and micropipette at a speed of 50nm/s, which corresponds to a pulling rate on 
the tethered MBP construct of ~5 pN/s at unfolding. Differences between pie 
charts were analysed using one-tailed two-proportion z-test. Statistical analysis 
of sustained forces in different conditions was performed using the Kruskal- 
Wallis (one-way ANOVA on ranks) test, given that the forces are not distributed 
normally and have non-equal variances for different groups. Test results such 
as P values are reported in the main text. In box plots, whiskers indicate 90% 
and 10% extreme values, within the box the band indicates the median, rectangle 
indicates the mean. In Fig. 3g, error bars are +1 s.d. In Fig. 3h, error bars 
are +1 s.e.m. 

Data availability. The data that support the findings of this study are available 
from the authors on reasonable request. 
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Extended Data Figure 1 | Domain architectures of DnaK, DnaJ and 
GrpE. a, Structure of E. coli DnaK bound to peptide substrate and ADP 
nucleotide (PDB ID: 2KHO) in N-to-C-terminal colour gradient from red 
(N terminus) through dark grey to blue (C terminus)**. b, DnaJ domain 
structure containing J domain, glycine/phenylalanine-rich domain (G/F), 
cysteine-rich region/zinc-finger (CRR) and C-terminal domain; partial 
NMR structures of J domain (PDB ID: 1XBL) and crystal structure 
comprising cysteine-rich domain (CRR) and peptide-binding domain of 
the yeast homologue with co-crystalized peptide substrate highlighted in 
green spheres (PDB ID: INLT)**"°. c, GrpE structure (PDB ID: 1DKG)**. 
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Extended Data Figure 2 | Canonical nucleotide-driven cycle of 
Hsp70-client interactions. Hsp70 is known to interact with a large 

range of protein conformations from unfolded nascent chains to near- 
native proteins to aggregates and misfolded states. Non-native protein 
conformers are captured via short extended peptide segments by the 
substrate-binding groove of Hsp70. Hsp40 (DnaJ) then accelerates the ATP 
hydrolysis reaction together with the bound substrate. Hsp70-ADP locks 
the substrate under the closed helical lid of the substrate-binding domain 
in the ‘high-affinity’ state. The lid could also adopt the open conformation 
according to recent indications'®'*°. Nucleotide exchange factor (GrpE) 
subsequently accelerates the release of ADP and either spontaneous 
fluctuations of the lid or new ATP molecules facilitate the release of the 
substrate to regenerate Hsp70-ATP for another round of the cycle*”. 
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DnakK-ADP 


DnakK-ATP 


Extended Data Figure 3 | Protein structures. a, DnaK in ADP state 
(PDB ID: 2KHO). Domain boundaries indicated in different colours: 
nucleotide-binding domain (purple, 1-383), hydrophobic linker (green, 
384-396), 3-sheet subdomain of substrate-binding domain with groove 
(orange, 397-502), a-helical lid subdomain of substrate binding domain 
(yellow, 503-602)**. b, DnaK in ATP bound open conformation (PDB ID: 
4B9Q). c, Apo MBP (PDB ID: 2MV0). Proteins are displayed in the same 
scale to aid visualizing interactions between them. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Extended Data Figure 4 | Predicted peptides binding to DnaK for representations of predicted peptides are restricted to their backbone 
MBP, luciferase and RepE54. a, Peptide-library-trained predictions of atoms as their accessibility is central to canonical peptide binding of 
Dnak-interacting peptide segments in the unfolded chain of MBP mapped _DnaK. b, Idem for MBP core structure. c, Idem for firefly luciferase 
in orange, using the algorithm introduced by Riidiger et al.!* (PDB ID: (PDB ID: 1LCI) d, Idem for RepE54 (PDB ID: 1REP) including DNA 
1ANF). N terminus marked green, C terminus marked blue. Surface ligand. 
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Extended Data Figure 5 | Stabilization of folded structures by DnaK. 

a, Schematic diagram of the setup. DnaK and ADP were present in 

the experimental buffer from the start of the experiment. b, In the first 
stretching curve (blue), only the C-terminal fragment of MBP was 
unfolded, and then force was immediately reduced to low force preventing 
further unfolding. After 3 min waiting at low force, subsequent stretching 
showed resistance to forced unfolding up to 50 pN (orange). Experiments 
performed in 100nM DnaK with 1mM ADP loading buffer. c, Force 
acting on MBP is plotted versus time. Pulls (single stretching—-relaxation 
cycles) on the same MBP molecule are followed by increasing waiting 
periods at 0 pN, in the presence of 111M DnaK and 1mM ADP loading 
buffer. d, Force-extension curves of MBP indicate increasing protection of 
partial folds against force (corresponding to panel c). Earlier pulls are in 
blue, later in red. e, Force on MBP versus time. Identical buffer conditions 
as panels c and d. f, Force-extension curves of MBP indicate the 
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Extension (um) 
stabilization of compact state of MBP (corresponding to panel e). First pull 
is highlighted in blue, subsequent pulls in red. g, Force-extension curves 
of MBP in the presence of 100nM DnaK and 1 mM ADP loading buffer. 
Blue, first stretching curves on different MBP molecules; orange and red, 
stretching curves after refolding and 3 min waiting at low force. h, i, MBP 
stretching and relaxation experiments in the presence of 141M DnaK 
with 100|.M ADP purified by high-performance liquid chromatography 
(HPLC) were present. First stretching curves are shown in blue, stretching 
curves denoted in shades of red were acquired after 3 min waiting at low 
force. j, MBP stretching and relaxation in the presence of 100nM DnaK 
mutant T199A and 1 mM ATP, which traps DnaK in the ATP state. 

k, Corresponding refolding probability (n = 18). 1, Force-extension curves 
of MBP in the presence of 1 1M lid-truncated DnaK and 1mM ADP 
loading buffer, after refolding and 3 min waiting at low force (n= 13). 
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Extended Data Figure 6 | DnaK mutant structures. a, DnaK(V436F) 
(termed groove-mutated) structure model highlighting in red spheres the 
location of the point mutation hindering access to its groove, resulting 

in a 38-fold reduction in peptide-binding affinity** (V436F substitution 
modelled onto PDB ID: 2KHO). b, DnaK(2-538) (termed lid-truncated) 
structure (lid deletion modelled using 2KHO). The part of the lid that is 
still present in DnaK(2-538) interacts with the nucleotide binding domain 
in the ATP-bound open conformation, which might be important for the 
mechanics of DnaK. 
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Extended Data Figure 7 | 4MBP refolding in the presence of ATP and wild-type or mutant DnaK. a, Diagram of the experiment. b-d, Stretching 
curves of 4MBP in the presence of ATP and wild-type DnaK (b), lid-truncated DnaK (c), and groove-mutated DnaK (d). 
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Extended Data Figure 8 | MBP interaction with co-chaperone DnaJ and nucleotide exchange factor GrpE. a, Diagram of the 4MBP experiments. 


b, Stretching curves of 4MBP with DnaJ. c, Diagram of the 1MBP experiments. d, 1MBP stretching in the presence of DnaJ. e, 1MBP stretching in the 
presence of GrpE. 
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Extended Data Figure 9 | DnaK increases thermal stability of RepE54. 
a-f, Thermal denaturation curves of RepE54 as measured by tryptophan 
fluorescence in the absence (a) or presence (b, c, f) of tryptophan-free 
DnaK(W102F) with ADP loading buffer (b), ADP (c), ATP (f) or 
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lid-truncated (d) or groove-mutated (e) tryptophan-free DnaK with ADP 
loading buffer. Vertical lines mark the apparent melting points of RepE54 
only (blue) and Dnak-stabilized RepE54 (red). Error bars indicate the 
standard error of the mean over three replicates. 
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Extended Data Figure 10 | DnaK preserves enzyme activity. Bulk 
luciferase protection functional assay monitoring luciferase activity 
at 37°C in the absence or presence of nucleotides and chaperones as 
indicated. The active fraction of luciferase after 45 min from the start 
of temperature upshift from 0 °C to 37°C is shown. The experiment is 
performed in triplicates; error bars indicate the standard error of the 
mean. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


doi:10.1038/nature20149 


Local regulation of gene expression by IncRNA 
promoters, transcription and splicing 


Jesse M. Engreitz!*, Jenna E. Haines'+, Elizabeth M. Perez!, Glen Munson!, Jenny Chen!?, Michael Kane!, Patrick E. McDonel't, 


Mitchell Guttman? & Eric S. Lander!*° 


Mammalian genomes are pervasively transcribed’? to produce 
thousands of long non-coding RNAs (IncRNAs)*”". A few of these 
IncRNAs have been shown to recruit regulatory complexes through 
RNA-protein interactions to influence the expression of nearby 
genes” ’, and it has been suggested that many other IncRNAs can 
also act as local regulators®’. Such local functions could explain 
the observation that IncRNA expression is often correlated with 
the expression of nearby genes!*!!, However, these correlations 
have been challenging to dissect!” and could alternatively result 
from processes that are not mediated by the IncRNA transcripts 
themselves. For example, some gene promoters have been proposed 
to have dual functions as enhancers!3~!°, and the process of 
transcription itself may contribute to gene regulation by recruiting 
activating factors or remodelling nucleosomes'”!”!*. Here we use 
genetic manipulation in mouse cell lines to dissect 12 genomic loci 
that produce IncRNAs and find that 5 of these loci influence the 
expression of a neighbouring gene in cis. Notably, none of these 
effects requires the specific ncRNA transcripts themselves and 
instead involves general processes associated with their production, 
including enhancer-like activity of gene promoters, the process of 
transcription, and the splicing of the transcript. Furthermore, such 
effects are not limited to IncRNA loci: we find that four out of six 
protein-coding loci also influence the expression of a neighbour. 
These results demonstrate that cross-talk among neighbouring genes 
is a prevalent phenomenon that can involve multiple mechanisms 
and cis-regulatory signals, including a role for RNA splice sites. 
These mechanisms may explain the function and evolution of some 
genomic loci that produce IncRNAs and broadly contribute to the 
regulation of both coding and non-coding genes. 

We analysed 12 IncRNA loci whose RNA transcripts in mouse 
embryonic stem cells (mES cells) show preferential localization to the 
nucleus and span a range of abundance levels (Methods and Extended 
Data Fig. 1). For each locus, we looked for direct regulatory effects on 
local gene expression by using a genetic approach based on classical 
cis-trans tests (Fig. la and Supplementary Note 1). Specifically, 
we generated clonal cell lines carrying heterozygous knockouts of 
the promoter (~600-1,000-bp deletions) (Fig. 1b) and compared the 
expression of nearby genes within 1 Mb on the cis and trans alleles (that 
is, on the modified and unmodified homologous chromosomes in the 
same cells) (Supplementary Note 2). Changes in neighbouring gene 
expression that involve only the cis allele very probably result from 
direct, local functions of the IncRNA locus, while changes that involve 
both the cis and trans alleles probably result as indirect, downstream 
consequences of the IncRNA acting elsewhere (Supplementary Note 1). 
We performed genetic modifications in 129/castaneus F1 hybrid mES 
cells that contain a polymorphic site every ~140 bp, enabling us to dis- 
tinguish the two alleles using RNA sequencing (Fig. 1b, Extended Data 
Fig. 2 and Supplementary Note 3). 


At 5 of these 12 IncRNA loci, promoter knockouts significantly 
affected the expression of a nearby gene in an allele-specific manner 
(false discovery rate <10%), including both activating and repressive 
effects (Fig. 1c, d, Supplementary Note 4 and Extended Data Fig. 3). 
For each locus, the affected gene was located immediately adjacent to, 
and within 5-71 kb of, the knocked-out promoter (Fig. 1c and Extended 
Data Fig. 4). This indicates that a substantial fraction of IncRNA loci 
influence the expression of a neighbouring gene. 

To test whether such effects were specific to IncRNA loci, we 
deleted the promoters of six protein-coding genes (Extended Data 
Fig. 1). Surprisingly, knockouts at four of these loci also affected the 
expression of a neighbour in cis (Fig. 1c, d and Extended Data Fig. 5). 
Thus, both non-coding and coding loci can directly influence local 
gene expression. These regulatory connections may contribute to the 
observed correlations in the expression of neighbouring genes, which 
have been reported both for IncRNAs and for mRNAs!™!119.20, 

Because in these experiments we deleted gene promoters, the 
mechanisms underlying such cis effects could in principle involve 
(i) DNA regulatory elements in gene promoters!*~"*; (ii) the process 
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Figure 1 | Many IncRNA and mRNA loci influence the expression of 
neighbouring genes. a, Knocking out a promoter (black) could affect a 
neighbouring gene (blue) directly (local) or indirectly (downstream). 

b, Knockout of the linc 1536 promoter. Left, genotypes; right, allele-specific 
RNA expression for 129 and castaneus (Cast) alleles normalized to 81 
control clones (+/+). Error bars, 95% confidence interval for the mean 
(n=2 for —/—, 3 for +/—, 1 for —/+). c, Gene neighbourhoods oriented 
so each knocked-out gene (black) is transcribed in the positive direction. 
Blue neighbouring genes show allele-specific changes in expression. 

+See Supplementary Note 3. d, Average RNA expression on promoter 
knockout compared to wild-type alleles (n > 2 alleles, see Supplementary 
Table 1). *FDR < 10%; ***FDR < 0.1%. 
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of transcription!®!”'; or (iii) the RNA transcripts themselves*° 


(Extended Data Fig. 6a). To begin to distinguish among these possible 
mechanisms, we inserted early polyadenylation signals (pAS) 0.5-3 kb 
downstream of each transcription start site (TSS) that eliminated the 
production of most of the RNA while leaving the promoter sequence 
intact (Fig. 2 and Extended Data Fig. 6b, c, see Methods). We examined 
four IncRNA loci and two mRNA loci where promoter deletion affected 
the expression of a neighbouring gene (see Supplementary Note 5). 

As one example, we describe the linc1536 locus, hereafter called 
Bendr (Bend4-regulating effects not dependent on the RNA, Fig. 2a). 
Whereas deleting the Bendr promoter reduced the expression of the 
adjacent Bend4 gene by 57%, inserting a pAS into the first intron of 
Bendr (~570 bp downstream of the TSS in this ~13 kb locus) had no 
effect on Bend4 expression despite eliminating the spliced Bendr RNA 
(Fig. 2b, c). Furthermore, global run-on sequencing (GRO-seq) did not 
detect any transcriptionally engaged polymerase upstream of the pAS 
insertion (Fig. 2c and Extended Data Fig. 7a), perhaps because the pAS 
prevents RNA splicing, which may substantially reduce transcriptional 
activity in the modified locus!?. Therefore, cis activation of Bend4 
requires neither the mature Bendr RNA transcript nor significant Bendr 
transcription. Instead, this effect appears to be mediated by DNA regu- 
latory elements in the ~750 bp knocked-out promoter-proximal region. 

In total, at five of the six loci examined with pAS insertions (including 
three IncRNAs and two mRNAs), DNA regulatory elements in the 
promoter-proximal sequences appear to be responsible for activating a 
neighbouring gene (Extended Data Fig. 7b). Although the promoters 
in these loci would not be classified as ‘enhancers’ based on H3K4me3/ 
H3K4mel ratios”, they are bound by mES cell transcription factors 
(Extended Data Fig. 7c) and are located in close proximity to their 
neighbouring target genes (Fig. lc and Extended Data Fig. 7d, e), 
suggesting that these promoters may affect local gene expression 
through mechanisms similar or identical to enhancers'>*”». 

We also identified one locus, linc1319 (renamed Blustr: bivalent 
locus (Sfmbt2) is upregulated by the splicing and transcription of an 
RNA), where both promoter deletions and pAS insertions substan- 
tially reduced the expression of a neighbouring gene, Sfmbt2, located 
5kb upstream (Fig. 3a). To dissect the regulatory mechanism, we 
tested whether the activation of Sfmbt2 is mediated by a sequence- 
specific function of the Blustr transcript or the process of transcription 
(by which we mean one or more sequence-independent functions 
associated with transcription, such as changes in chromatin state or 
recruitment of co-factors). To test the first possibility, we knocked 
out each of the three downstream exons and three introns. None of 
these deletions impaired Sfmbt2 activation (Fig. 3b, Supplementary 
Note 6), suggesting that the activation of Sfmbt2 does not require 
unique sequences or structures in the Blustr transcript itself. To test 
the second possibility, we engineered pAS insertions at five different 
locations in the first exon or intron (+40 bp to +15 kb downstream of 
the TSS) and found that increasing the length of the Blustr transcribed 
region led to increased activation of Sfmbi2 (Fig. 3b and Extended Data 
Fig. 8a, b). We note that changing the length of the transcribed region 
affected the total amount of engaged polymerase in the Blustr locus 
(Fig. 3c). Thus, Sfmbt2 activation responds to changes in the length/ 
amount of transcriptional activity in the Blustr locus but does not 
appear to require specific sequence elements in the mature Blustr 
transcript (Supplementary Note 7). 

Because promoter-proximal splice sites and the process of splicing 
can enhance transcription—in some cases by as much as 100-fold???— 
we tested whether the splicing of Blustr is involved in Sfmbt2 activation. 
Upon deleting the 5’ splice site of the first intron of Blustr (Extended 
Data Fig. 8c), we observed a 94% reduction in Blustr transcription 
(as assayed by GRO-seq), a 92% reduction in the levels of the mature 
Blustr transcript, and an 85% reduction in Sfmbt2 expression (Fig. 3b, c 
and Extended Data Fig. 8a, b), demonstrating that the first 5! splice site 
of Blustr has a critical role in activating Blustr and Sfmbt2 transcription. 
By contrast, downstream splice sites were dispensable: upon deleting 
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a, Transcriptionally engaged RNA polymerase (GRO-seq) and H3K4me3 
occupancy (chromatin-immunoprecipitation followed by sequencing, 
ChIP-seq). b, poly(A)* RNA expression upon deleting the Bendr promoter 
or inserting a pAS on modified alleles versus controls. Error bars, 95% 
confidence interval for the mean (n > 2 alleles, see Supplementary Table 1). 
c, Allele-specific GRO-seq signal for clones carrying the indicated 
modifications. Both clones are modified on the 129 allele, and only reads 
specifically mapping that allele are shown. The y axis shows normalized 
read count. Bar plot quantifies signal at Bend4, including seven additional 
wild-type controls not shown on left. 
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downstream Blustr exons, splicing skipped over the removed exon to 
the next available 3’ splice site (Extended Data Fig. 8d) and Sfmbt2 
expression was unaffected (Fig. 3b). 

Together, these data demonstrate that the 5! splice site and the 
process of transcription in the Blustr locus are important for its ability 
to regulate Sfmbt2. This indicates that the Blustr RNA is in fact required 
for Sfmbt2 activation (splicing involves direct interactions between the 
spliceosome and the nascent transcript), although this mechanism does 
not appear to depend on the precise sequence of the RNA beyond the 
presence of initial splice signals. One possibility is that the 5’ splice 
site promotes transcriptional activity in the Blustr locus, which in turn 
recruits components of the transcriptional machinery that act on the 
nearby Sfmbt2 promoter (Fig. 3d, Supplementary Note 7). Consistent 
with this model, altering transcription or splicing in the Blustr locus 
led to changes in chromatin state at the Sfmbt2 promoter (including 
reductions in H3K4me3 and spreading of H3K27me3) and reduced 
occupancy of engaged RNA polymerase in the paused position just 
downstream of the Sfmbt2 TSS (Extended Data Fig. 8b, e, f). Thus, 
changes in Blustr transcription and splicing may affect Sfmbt2 expres- 
sion in part by altering chromatin state and RNA polymerase occu- 
pancy at the Sfmbt2 promoter (Fig. 3d and Supplementary Note 7). 

In summary, genetic dissection of 12 IncRNA loci and 6 mRNA 
loci found that 9 loci (50%) regulate the expression of a neighbouring 
gene (Extended Data Fig. 9). In most of these loci, including Bendr, 
local effects are mediated by enhancer-like functions of DNA elements 
in promoters. In one locus, Blustr, the processes of transcription 
and splicing also contribute to cis-regulatory functions, perhaps by 
increasing the local concentration of transcription-associated factors. 
We did not identify any IncRNA loci in which local effects are mediated 
by sequence-specific functions of the IncRNA transcript. Because there 
exist thousands of other loci that fit our selection criteria, we expect 
that similar mechanisms broadly contribute to gene regulation in many 
loci (Supplementary Note 8). 

The frequent cross-talk between neighbouring genes observed in 
our study indicates that gene loci can encode multiple independent 
categories of functions. Category I involves functions of the RNA 
product: mRNAs provide a template for protein synthesis, and 
some non-coding transcripts (for example, XIST) act as functional 
IncRNAs. Category II involves the effects of transcription-related 


17 NOVEMBER 2016 | VOL 539 | NATURE | 453 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a linc1319/Blustr Sfmbt2 
PNA PO he ey lh a tae Boe ees Lele 
aistrand : 7 = cl tle ak te th te pete Rea oe te 
GRO-seq aa oy ee Seen 
H3K4me3_ | So ee ere re ee a 
14 tle osu EF | }-4 
b RNA ies esas Blustr Sfmbt2 
ee 0% 100% 200% 0% 100% 200% 
Blustr ; : ; ; 
rreool 1 Wild-type [i = 
a +f Promoter deletion } fr 
on ee Lh I Deletion: Exon 2 
i Af I Exon 3 
1 1 
Aa | | Exon 4 \ ye 7 4,502% 
DION fsseeteeeenaneeeeenneneeeeseneeay A Intron 1 4 
Looper | Intron 2 —_ = 
ar \ | Intron 3 ; 
| e pAS: +40 bp _ ay 
mi e +0.5 kb il 
| 1 ee! +2 kb 
Tl e +4kb 
ti | e | +15 kb | | 
1 1 
| I 5’ splice site ! 4 1 
1 1 
Cc Allele-specific GRO-seq d @ 
ail, me = RNA polymerase II 
‘ii cc, ili, Wild-type ) © chromatin regulators 


Promoter deletion 


pAS: +2 kb 


i= s . A 
a 
+15 kb 


5’ splice site 


Figure 3 | Transcription and splicing of Blustr activates Sfmbt2 
expression. a, Poly(A)* RNA-seq, GRO-seq, and H3K4me3 ChIP-seq 

in the Blustr locus. Sfmbt2 has two alternative TSSs. b, Poly(A)* RNA 
expression on modified alleles compared to controls (arrows). Error bars, 
95% confidence interval for the mean (n > 2 alleles, except for pAS 


processes—including mechanisms mediated by promoters, 
transcription, and splicing—on the regulation of other nearby genes. 

The fact that many IncRNA loci have category II functions does not 
necessarily mean that they do not also have category I functions, and we 
note that our experiments do not rule out the possibility that the IncRNAs 
dissected in this study have RNA-mediated functions other than on 
local gene regulation. However, the prevalence of category II func- 
tions suggests a model for the evolutionary origins of some IncRNAs. 
In loci where a promoter acts as an enhancer, RNA transcripts may 
arise as non-functional by-products'®. In loci where co-transcriptional 
processes have cis-regulatory functions, the nascent transcripts might 
contribute through mechanisms such as splicing that require little 
RNA-sequence specificity. These possibilities are particularly intriguing 
in light of the patterns of evolutionary conservation of IncRNA loci”**. 
For example, although most IncRNA transcripts expressed in mES cells 
are not conserved (no RNA detected in syntenic loci in other mammals, 
see Methods), the promoters in some of these loci correspond to 
conserved DNA sequences that have an enhancer chromatin signature 
in human ES cells (Fig. 4, Extended Data Fig. 10 and Supplementary 
Note 9). These sequences may have conserved functional roles as 
cis-regulatory elements, rather than as IncRNA promoters. Thus, 
mechanisms associated with cis functions by promoters, transcription, 
and/or RNA processing may contribute to the functions and evolution 
of an important subset of non-coding loci in mammalian genomes 
(Extended Data Fig. 10c). 

Beyond the implications for IncRNAs, these cis-regulatory connec- 
tions between neighbouring genes occur in both protein-coding and 
non-coding loci and thus appear to represent a fundamental property 
of mammalian gene regulatory networks. The properties of these 
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+15 kb where n= 1, see Supplementary Table 1). Sfmbt2 pAS comparisons: 
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for clones carrying indicated modifications. Only reads mapping to the 
modified allele are shown (castaneus for pAS +2 kb; 129 for others). 

d, Model for how transcription in the Blustr locus activates Sfmbt2. 
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Figure 4 | Evolutionary conservation of mES cell IncRNAs and their 
promoters. a, Classification of a subset of IncRNAs expressed in mES cells 
(see Supplementary Note 9, Methods). b, Eleven IncRNAs have promoters 
whose syntenic sequence corresponds to putative DNA regulatory 
elements (REs) marked by DNase I hypersensitivity (HS) in human 

ES cells. c, Example: linc1494. d, Enhancers and IncRNA promoters are 
significantly enriched for corresponding to human regulatory elements 
(pie chart, ***P <1 x 10~!°, y? test versus GC-matched random regions) 
and show elevated sequence conservation compared to GC-matched 
regions (bar plot, **P < 0.01; ***P < 0.001, Mann-Whitney U-test 

versus ii-+ iii). 
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cis-regulatory connections—including mechanisms for specificity and 
the potential for cooperative dynamics of gene activation—represent 
key areas for future investigation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Cell lines and cell culture. F1 hybrid 129/castaneus female mouse embryonic 
stem cells (gift from K. Plath) or V6.5 male mouse embryonic stem cells (gift from 
A. Meissner) were cultured in serum-free N2B27-based medium (250 ml 
neurobasal media (Gibco), 250 ml DMEM/F12 (Gibco), 5ml 100 x N2 supplement 
(Gibco), 5 ml 50x B27 supplement (Gibco), 5 ml 200 mM L-glutamine (Gibco), 
3.6 ul 2-mercaptoethanol, 501g human leukaemia initiation factor (5 x 10° 
units, EMD Millipore), 7.4\1g progesterone, 10 mg bovine insulin (Sigma), 350 il 
7.5% BSA fraction V (Gibco), supplemented with MEK inhibitor PD0325901 
(501 10 mM, SelleckChem), and GSK3b inhibitor CHIR99021 (15011 10mM, 
SelleckChem)). Prior to plating cells, tissue culture dishes were pre-treated with 
PBS + 0.2% gelatin (Sigma) and 1.75 xg ml! laminin (Sigma) for 2-10 h at 37°C. 
At each passage, cells were trypsinized for 3-5 min in TVP solution (0.025% 
trypsin, 1% chicken serum (Sigma), and 1 mM EDTA in PBS pH 7.4) at room 
temperature. Cells tested negative for mycoplasma contamination and were authen- 
ticated by comparing polymorphisms to 129S1 and castaneus genomes. 
Cellular fractionation. To estimate the relative abundance of IncRNAs in 
different cellular compartments, we performed cellular fractionation to isolate 
chromatin-associated, soluble nuclear, and cytoplasmic fractions essentially as 
described”’. In brief, we first lysed 5 million cells in 200,11 cold cell lysis buffer 
(10 mM Tris-HCl pH 7.5, 0.05% IGEPAL CA-630, 150 mM NaCl), incubating 
on ice for 5 min. We layered the cell lysate over 2.5 volumes of chilled sucrose 
cushion (24% sucrose in cell lysis buffer) and centrifuged at 15,000g for 10 min. 
The supernatant from this spin became the cytoplasmic fraction. After washing 
the pellet of nuclei with PBS (pH 7.5) +1mM EDTA, we resuspended the pellet 
in 1001] of cold glycerol buffer (20 mM Tris-HCl pH 7.5, 75mM NaCl, 0.5mM 
EDTA, 0.85mM DTT, 0.125mM PMSE, 50% glycerol) by gently flicking the tube. 
We added 100 11 of cold nuclei lysis buffer (10 mM HEPES pH 7.5, 1mM DTT, 
7.5 MgClo, 0.2mM EDTA, 0.3 M NaCl, 1 M urea, 1% IGEPAL CA-630), then vor- 
texed for four seconds. After 2 min on ice, we spun the nuclear lysate at 15,000g for 
2min. This supernatant was collected as the soluble nuclear (nucleoplasm) fraction. 
We rinsed the remaining pellet (chromatin fraction) in PBS +1mM EDTA, then 
resuspended the chromatin in 300,11 chromatin DNase buffer (20 mM Tris-HCl 
pH 7.5, 50mM KCl, 4mM MgCh, 0.5mM CaCh, 2mM TCEP, 0.5mM PMSF, 
0.4% sodium deoxycholate, 1% IGEPAL CA-630, 0.1% N-lauroylsarcosine) plus 
15] murine RNase inhibitor (NEB) and 30x] TURBO DNase (Ambion). The 
DNase digestion proceeded for 20 min at 37 °C and was halted by adding 10mM 
EDTA and 5mM EGTA. Protein was digested with proteinase K for 1h at 37°C. 
RNA was isolated using Zymo RNA Concentrator-25 columns (two columns for 
the cytoplasmic fraction). With this method, nuclear-associated endoplasmic 
reticulum is known to fractionate with the nucleoplasm”’, and we observed that 
nucleolar RNAs fractionated with chromatin (data not shown). From each cellular 
fraction, we sequenced total RNA and polyadenylated RNA (selected using oligo 
d(T)25 magnetic beads, NEB) using a strand-specific RNA-sequencing protocol 
for Illumina instruments described previously”. 

Selection criteria for knocked-out IncRNAs. We selected IncRNA loci initially 
identified and defined by a chromatin signature of H3K4me3 at promoters and 
H3K36me3 through gene bodies*. We further required that IncRNAs selected 
for knockout analysis have TSSs, as defined by cap analysis of gene expression 
(CAGE), located >5 kb from other genes (for epigenomic annotation of each locus, 
see http://pubs.broadinstitute.org/neighboring-genes/). To prioritize intergenic 
IncRNA loci that may regulate local gene expression, we focused on IncRNAs that 
have subcellular localization biased towards the nucleus versus the cytoplasm 
(Extended Data Fig. 1). We performed cellular fractionation experiments in V6.5 
male mES cells as described above and sequenced RNA from chromatin-associated, 
soluble nuclear, and cytoplasmic fractions (GEO Accession GSE80262). We 
calculated a relative nuclear-to-cytoplasmic ratio (chromatin RPKM plus soluble 
nuclear RPKM divided by cytoplasmic RPKM) and focused on IncRNAs with ratios 
above the median (1.5): these IncRNAs are preferentially localized to the nucleus 
compared to other IncRNAs and mRNAs. We selected nuclear-biased IncRNAs 
that span a range of abundance levels (Extended Data Fig. 1). We also included 
some IncRNAs that are conserved across mammalian evolution (Snhg3, Snhg17, 
Meg3, and linc2025). 

Selection criteria for knocked out mRNAs. We selected six mRNAs for promoter 
knockouts based on the following criteria. We knocked out two mRNAs that are 
moderately expressed and are not expected to be essential for mES cell growth 
(Dicer1 and CrIf3). We knocked out two mRNAs that are located adjacent to 
knocked-out IncRNAs (Sfmbt2 and Rcc1), in order to look for reciprocal regulatory 
effects between the IncRNA and the affected mRNA. We knocked out two mRNAs 
that are located adjacent to a gene that is itself adjacent to a ncRNA (Gpr19 and 
Slc30a9), in order to determine whether affected genes are specifically responsive 
to IncRNA promoters or are generally responsive to other promoters in the locus. 


Similar to the IncRNAs selected, the TSSs of these selected mRNAs are located 
>5kb from other genes. 

CRISPR sgRNA design. To design single-guide RNAs (sgRNAs), we built custom 
software to calculate a specificity score (based on potential off-target sites using 
the algorithm described at http://crispr.mit.edu (see ref. 31)) and an efficacy score 
(based on a sequence model for sgRNA efficiency as previously described*”) for 
each 20-nt targeting sequence. We removed guides with specificity scores <20 or 
efficacy scores >0.7. To avoid T-rich sequences that result in premature termina- 
tion of Pol III-mediated sgRNA transcription, we removed guides with more than 
one T in the four bases closest to the seed region, guides with more than three 
consecutive Ts, and guides with more than eight Ts total. We removed guides 
with homopolymer stretches of five or more bases and guides with GC content 
<20% or >90%. We removed guides that overlapped a known 129/castaneus 
SNP**. Within a given region, we typically chose the three remaining guides with 
the highest specificity scores. The sequences of all sgRNAs used in this study are 
listed in Supplementary Table 2. 

Promoter deletion guide placement. To knock out a ncRNA or mRNA promoter, 
we chose 2-3 sgRNAs located in windows 300-500 bp upstream and downstream 
of the TSS, leading to deletions of approximately 600-1,000 bp surrounding 
the TSS. We adjusted the precise deletion boundaries outward if we could not 
successfully design guides in these regions (for example, because they were 
located in repetitive sequences). We note that we often found that the wild-type 
alleles in heterozygous knockouts were affected by scars from repair of ssRNA 
double-stranded breaks. Accordingly, we adjusted the bounds if necessary to 
cut outside of the exons of the mRNA or IncRNA and thus avoid damaging the 
exonic sequences on the wild-type alleles in heterozygous knockouts. We note that 
the presence of these scars (and their lack of allele-specific effects on the expres- 
sion of neighbouring genes) indicate that the cis effects observed upon deleting 
promoters are not merely a result of CRISPR-mediated cutting and subsequent 
DNA repair. 

Genetic deletions with CRISPR/Cas9. To delete specific sequences, we co- 
transfected 100 ng of Cas9-expressing plasmids (PX330-NoGuide), 300 ng of a pool 
of sgRNA-expressing plasmids (pZB-Sg3), and 100ng ofa plasmid expressing EGFP 
and a puromycin selectable marker from a CAG promoter (pS-pp7-GFPiP). To 
create PX330-NoGuide, we modified PX330 (gift from FE. Zhang, Addgene plasmid 
#44230 (ref. 34)) to remove the sgRNA expression cassette. To generate pZB-Sg3, 
we cloned a human U6 promoter and optimized sgRNA scaffold sequence*? into 
a minimal vector with an ampicillin-selectable marker and a ColE1 replication 
origin. We transfected batches of 250,000 mouse embryonic stem cells using the 
Neon Transfection System (Invitrogen), using one pulse of 40 ms at 1,200 V and 
plated two batches of cells (500,000 total) into a 96-well plate in 200 jl media. 
As an internal control for each set of transfections, we performed a transfection 
using four guides with no predicted target sites in the mouse genome. 

We verified efficient transfection by examining GFP expression after 24h. 
To select for transfected cells, we replaced the media 24h after transfection with 
200 1 2i+ 1ppg ml! puromycin. One day later, we split the cells into a 10-cm plate 
with 8 ml of 0.5,.g ml“! puromycin. One day later, we replaced the media with 
10 ml of 2i with no puromycin. We allowed cells to grow for 7-8 days, replacing 
the media every 2-3 days. We hand-picked 88 individual colonies and 8 control 
colonies for each transfection in 511 media, added 20 ul of TVP for ~10-20 min at 
37°C to dissociate the colonies, and then split the colonies into two identical plates. 
We grew the cells in these plates for 4-5 days. We harvested one of the plates for 
DNA and RNA extraction by removing most of the media and adding 3.5 x volume 
buffer RLT (Qiagen) and froze the other plate for later recovery in Freezing Media 
(2i media + 10% fetal bovine serum + 10% DMSO). 

Genotyping by PCR and sequencing. To genotype each promoter knockout, we 
extracted genomic DNA and performed PCR using primers spanning the deleted 
sequence. We genotyped each clone by running the PCR products on agarose 
gels and comparing PCR amplicon sizes to predicted wild-type and deletion 
band sizes. We confirmed the sequences of wild-type and deletion bands by 
Sanger sequencing or high-throughput sequencing through barcoded amplicon 
sequencing on an Illumina MiSeq (see Supplementary Table 2). Where possible, we 
used known polymorphic sites from 129$1 and castaneus genomes* to determine 
the haplotype-resolved genotype of each clone. Based on the genotyping data, we 
nominated clones for RNA sequencing. We eliminated clones showing evidence 
of polyclonal or subclonal mutations, or complex mutations such as inversion or 
duplication of the genomic sequence between the sgRNAs. The sequences of all 
genotyping primers are listed in Supplementary Table 2. 

RNA sequencing libraries. We generated RNA sequencing libraries as previously 
described***, with some modifications for high sample throughput. We iso- 
lated RNA from harvested mES cells using RNeasy 96 columns. We enriched for 
poly(A)* RNA using oligo d(T)25 magnetic beads (NEB) and eluted in 18,11 H,O. 
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We fragmented RNA to an average of ~150 nt by adding 2,11 Ambion fragmenta- 
tion buffer and incubating at 70°C for exactly 2.5 min. After transferring quickly 
to ice, we added 401] of a master mix containing 1211 5x FNK buffer (50 mM 
Tris-HCl pH 7.5, 5mM MgCh, 0.6 mM CaCl, 50mM KCl, 10mM DTT, 0.01% 
Triton X-100), 1 pl Murine RNase Inhibitor (NEB), 3 ul FastAP Thermosensitive 
Alkaline Phosphatase (Thermo Scientific), 3 11 T4 Polynucleotide Kinase (NEB), 
and 111 TURBO DNase (Life Technologies). We incubated this reaction at 37°C 
for 30 min, then cleaned the reaction with MyOne SILANE magnetic beads” and 
eluted in 6,11 of H2O. 

We proceeded with the library preparation as previously described*, with one 

additional modification. To simplify the library preparation for many samples, we 
added unique sample barcodes (8 nt) during the first adaptor ligation*®. We used 
12 pools each with 4 barcodes in order to mitigate differences in the efficiency 
of ligation for different adaptor sequences. Following the first adaptor ligation, 
we pooled 12 samples together, including up to 9 clones corresponding to a 
single target gene as well as 3 control clones, during the first 70% ethanol wash 
of the SILANE-bead purification. We performed an extra SILANE purification 
using the same beads to remove excess adaptor and then proceeded with reverse 
transcription. 
Hybrid selection of RNA sequencing libraries. To measure allele-specific expres- 
sion for hundreds of genes in a cost-effective manner, we developed a hybrid 
selection strategy to enrich for allele-informative reads at target genes (Extended 
Data Fig. 2). We designed oligo pools to capture allele-informative sequences 
in the ~1,600 RNAs located in the genome within 1 Mb of one of the knockout 
targets. These target RNAs were divided into two independent pools: #140820 
and #141203. We used RefSeq RNA annotations for mRNAs and our custom 
annotations for most IncRNAs. We identified SNPs that would distinguish the 
12981 and castaneus genomes**. We designed 120-bp capture oligos in the vicinity 
of each 129/castaneus polymorphic site, tiling every 15 bp across either 600 bp 
(pool #140820) or 240 bp (pool #141203) centred on the SNP. We included probes 
targeting both alleles to minimize differences in capture efficiency between the 
two alleles. We filtered capture probe sequences as previously described*”. We 
included up to 10 oligos per targeted RNA, duplicating probes where necessary to 
include the sequences corresponding to each allele. Empirically, this probe design 
strategy in combination with the protocol described below enabled assessing allele- 
specific expression for 84% (611 of 731) of the targeted expressed genes in mES 
cells (RPKM > 2) at a sequencing depth of <5 million reads per sample. Target 
genes and oligos sequences for these pools are listed in Supplementary Table 3. 

We synthesized pools of 12,000 capture oligos using CustomArray tech- 
nology. Oligos in each pool were flanked by unique primers (Left primer 
sequence: CTTCCTACGAGCAGTTTGCC; right primer sequence: 
AGTTTACGCATTACGGGCAC). After one round of PCR to add a T7 promoter 
(GGATTCTAATACGACTCACTATAGGG), we generated biotinylated RNA 
probes as described previously**, adding in 20% Biotin-16-UTP (Roche) and 20% 
Biotin-14-CTP (Life Technologies) to the in vitro transcription reactions. We 
generated RNA probes targeting both strands by incorporating the T7 promoter 
into either side of the PCR product and performing two separate in vitro transcrip- 
tion reactions per oligo pool. 

To capture the allele-informative regions, we pooled the final, barcoded RNA 
sequencing libraries from all samples in the batch and performed a modified 
version of solution hybrid selection*’. We first combined 500 ng dsDNA library 
pool with 1 nmol of Illumina P5 and P7 primer mix in 21 j1l total. We denatured 
this mix at 94°C for 10 min and transferred immediately to ice. We added 7.5 1l 
20x SSPE, 0.51 Murine RNase Inhibitor (NEB), and 1 ul of 500 ng! biotiny- 
lated RNA probe, for a total volume of 30,11. We set up at least two reactions per 
10 libraries, including at least one reaction with each strand of probes. We incu- 
bated the hybridization reaction at 65 °C for 24-48 h. For each capture sample, we 
washed 301] Streptavidin Cl MyOne magnetic beads (Invitrogen) in 5x SSPE 
and aliquoted them into PCR tubes. After removing the wash from the beads, we 
added the hybridization reaction and mixed to resuspend the beads. We captured 
the biotinylated probes by shaking at 65 °C for 20 min. We washed the beads twice 
in 150,11 low stringency wash buffer (1 x SSPE, 0.1% SDS, 1% NP-40, 4M urea) at 
62°C for 3-4 min, and twice in 150,11 high stringency wash buffer (0.1 x SSPE, 0.1% 
SDS, 1% NP-40, 4M urea). To elute, we removed the final wash and resuspended 
beads in 1011 100 mM NaOH and heated to 70°C for 10 min. To complete the 
elution, we added 111 1 M acetic acid and 1411 NLS elution buffer (20 mM Tris- 
HCl pH 7.5, 10 mM EDTA, 2% N-lauroylsarcosine, 2.5 mM TCEP) and heated to 
94°C for 4min. While hot, we placed samples on magnet, removed eluate, and then 
placed the eluate on ice for at least 30 s. We cleaned the eluates with 20 1l MyOne 
SILANE magnetic beads as described”, using 7511 RLT and 61 il 100% ethanol for 
the initial precipitation. We eluted in 23 11 H2O, and used this as input for a 50,11 
NEBNext High Fidelity PCR reaction using 500 pmol of the P5 and P7 Illumina 
primers each (98°C for 30; 13 cycles of 98°C for 15s, 68°C for 30s, 72°C for 30s; 
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72°C for 2 min, 4°C hold). We cleaned the PCR reaction twice with 1 x volume 
Agencourt Ampure XP magnetic beads and eluted in 20,11 HO. 

Allele-specific gene expression measurements from RNA sequencing. We 
sequenced RNA libraries on an lumina HiSeq 2500 (Read 1: 38 cycles; Read 2: 
30 cycles; Index: 8 cycles). The first read includes the 8-nt barcode added during 
the first adaptor ligation (see above). Following processing to separate samples 
based on the inline barcodes, we filtered out sequencing reads that aligned to highly 
abundant RNA transcripts, including ribosomal RNAs, snRNAs, and repetitive 
elements, as defined by RefSeq and RepeatMasker. A FASTA file containing these 
sequences is available at the Gene Expression Omnibus (GSE55914). 

We developed a computational pipeline to estimate allele-specific expression 
from RNA-sequencing data. We created two separate reference files for the 129S1 
and castaneus haplotypes, starting with the mm9 genome build and layering on 
SNPs based on whole-genome sequencing of each of the two mouse strains*. 
We aligned RNA-sequencing data separately to each of the two haplotypes using 
TopHat (version 2.0.8). We combined the results of the two alignments using 
PySuspenders”, which identifies reads that map specifically to one or the other 
allele and splits them into separate BAM files. We discarded duplicate reads and 
reads with MAPQ <30. After generating separate BAM files containing the reads 
mapping to each allele, we counted reads that mapped to each RefSeq transcript 
(including both spliced and unspliced isoforms) using Scripture"! and calculated 
‘allelic expression ratios’ for each gene (counts from 129 allele divided by total 
counts from both 129 and castaneus alleles). The distribution of allelic expression 
ratios for all active genes in mES cells was centred on 0.5, indicating that on average 
each gene is expressed equally from the 129 and castaneus alleles (Extended Data 
Fig. 2b). This indicates that there is not systematic bias in our mapping procedure 
towards one allele or the other. 

RNA-seq data analysis. We processed RNA-sequencing data sets in batches 
corresponding to sets of libraries made on the same day with the same hybrid 
selection probe pool. We removed samples with fewer than 100,000 non-repetitive, 
unique, allele-informative reads. For within-batch quality control, we performed 
hierarchical clustering on all samples by their allelic expression ratios and removed 
the 2-5% of outlier samples, which were largely comprised of clones that showed 
monoallelic expression from the X chromosome. 

Assessment of gene knockout by expression analysis. The PCR genotyping 
procedure described above provided putative genotypes for the cell clones. We 
confirmed the genotype of cells by analysing the allele-specific expression of the 
knocked-out gene in each clone. We required that clones show >80% reduction 
of expression of the knocked out gene on the appropriate allele in order to include 
the clone in downstream analysis. Incomplete reduction of expression in some 
cases appeared to result from use of alternative TSSs that were not included in the 
deleted sequence. In other cases, incomplete reduction of expression appeared 
to result from subclonal genetic mosaicism within the cell line, which probably 
resulted from deletions that occurred after several cell divisions, leading to genetic 
differences between individual cells in a colony. For further analysis, we focused on 
gene loci where we obtained at least two heterozygous knockout clones. 
Barplots for allele-specific expression data. Barplots that depict allele-specific 
RNA expression or GRO-seq transcription on modified alleles compared to 
controls (for example, Fig. 1d) were calculated as follows. For each modified allele, 
allele-specific measurements were normalized to the corresponding alleles in wild- 
type clones (for example, values for castaneus knockout alleles were divided by the 
mean of unmodified castaneus alleles in wild-type clones). We performed the same 
calculation for unmodified alleles in wild-type clones to create a null distribution. 
For modified alleles, we further scaled these values by dividing by the mean of the 
wild-type alleles in heterozygous knockout clones. The value of each bar represents 
a mean of these normalized measurements. 

Identifying significant changes in allele-specific expression. In developing a 
statistical approach to identify local, cis effects of these genetic manipulations, 
we sought to distinguish local effects of the genetic deletion from downstream 
effects that result as a consequence of either ncRNA/mRNA functions elsewhere 
in the cell, off-target effects, or biological/technical variation between clonal cell 
lines (Supplementary Note 1). Our power to detect these effects varies between 
different measured genes (owing to their level of expression and availability of 
SNPs) and between different knockout targets (owing to differences in the numbers 
of knockout clones analysed). 

To account for these two variables, we developed a statistical approach to 
empirically estimate the false discovery rate of allele-specific changes in the 
expression neighbouring genes using hundreds of genes on other chromosomes 
as controls. For each gene in the neighbourhood of one of our promoter deletions, 
we calculated three statistics: (i) a t-test statistic comparing the average change 
in expression for each of the knockout alleles (including both heterozygous and 
homozygous knockout clones), normalized to the expression of the gene on the 
wild-type allele of the heterozygous clones; (ii) a z-score statistic comparing the 
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expression of the knockout allele in heterozygous clones to the expression of 
the wild-type allele in the same clone; and (iii) a t-test statistic comparing the 
heterozygotes to the wild-type control clones using the allelic expression ratio 
after applying a variance-stabilizing transformation (arcsin of the square root 
of the allelic expression ratio). For a given gene, only samples with at least 20 
allele-informative reads were considered, in order to enable accurate estimates 
of allele-specific expression. These three tests differ in whether they incorporate 
information from homozygous clones and how they normalize between knock- 
out and wild-type alleles. We required that a gene perform significantly in each 
of the three tests in order to regard the gene as significant, as described below. 
We note that each underlying measure was approximately normally distributed, 
with some apparent outliers across hundreds of control clones; we conservatively 
included these outliers in calculating each test statistic. We examined differences 
in variation between knockout and control alleles with Levene’s test. For estimates 
of the variance of distributions presented in figures, see Supplementary Table 1. 
Because the distributions are only approximately normal, we assessed the 
significance of each of these gene-level statistics by permutation, sampling other 
cell lines from the same experimental batch and randomly assigning them as 
heterozygous or homozygous knockout clones to match the distribution of 
genotypes of the real samples. We calculated an empirical false discovery rate 
for the sum of these permutation ranks, testing each of the neighbouring genes 
and using all of the genes on other chromosomes as the background model. 
Neighbouring genes with FDR <10%, a transformed allelic expression ratio 
>0.03, and an effect size of > 10% in heterozygotes were considered significant. 
No statistical methods were used to predetermine sample size, but we generated 
as many knockout clones as possible. The experiments were not randomized and 
the investigators were not blinded to allocation during experiments and outcome 
assessment. 
Transcriptional read-through for Meg3 and Snhg3. Promoter knockouts of Meg3 
and Snhg3 led to reductions in one or more downstream genes oriented in the 
same direction as the knockout target gene. We attributed these changes to tran- 
scriptional read-through based on the following evidence (Supplementary Note 4 
and Extended Data Fig. 3). For both Meg3 and Snhg3, we observed evidence for 
transcription continuing past the annotated 3’ end of the knockout target, through 
intergenic regions, and into the downstream gene (as assayed by RNA sequencing 
of chromatin-associated RNA). For the Meg3 locus, we did not observe H3K4me3 
or CAGE reads at the 5’ ends of Rian and Mirg (downstream of Meg3), indicating 
that they are not expressed from their own promoters. In the Snhg3 locus, the 
downstream affected gene (Rec1) is in fact expressed from its own promoter, but 
we found evidence for reads splicing from just downstream of Snhg3 into the first 
splice acceptor of Rec1, indicating that at least some fraction of Rccl transcripts 
begin at the Snhg3 promoter. 
Insertion of polyadenylation signals. To halt transcription, we initially attempted 
to use a short 49-bp synthetic polyadenylation signal (spA) sequence’ to 
minimize the amount of genomic sequence added (Extended Data Fig. 6b). For 
a given gene, we designed a guide 0.5-3 kb downstream of the transcription start 
site. We designed 200-nt ssDNA oligos including the spA sequence flanked by 
75- and 76-bp homologous arms, centred on the sgRNA cut site (~4 bp upstream 
of the PAM sequence), and ordered these as ultramers from Integrated DNA 
Technologies (Supplementary Table 2). To knock in polyadenylation signals, 
we transfected 100 ng PX330-NoGuide, 100 ng pZB, 100 ng pS-pp7-GFPiP, and 
100-200 ng of donor ssDNA oligo and followed the selection procedure described 
for the promoter knockouts. To genotype these insertions, we used a combina- 
tion of PCR and high-throughput amplicon sequencing as described above. We 
identified clones that had heterozygous insertions of the full 49-bp spA sequence 
on one allele; we typically observed that the other allele had a short insertion or 
deletion, consistent with non-homologous end joining (NHEJ)-mediated repair. 
This short pAS sequence (spA) succeeded in halting the transcription of three 
RNAs: Blustr (pAS at +40 bp and +0.5 kb in Fig. 3), Gpr19, and Bendr. However, 
for other genes, transcription was unaffected despite pAS knock-in, consistent with 
the location-dependent efficiency previously observed for this pAS sequence”. 
Accordingly, we built a larger construct containing three polyadenylation signals 
(p3PA, Extended Data Fig. 6c). The structure of this construct upon insertion into 
the genome through homologous recombination is as follows: spA-EFS promoter- 
Puromycin resistance gene IRES thymidine kinase-WPRE-SV40 pAS-PGK pAS 
(p3PA-Puro-iTk). We co-transfected 300 ng of this construct with 100 ng of pZB 
and 100 ng of PX330-NoGuide, waited three days, and then selected for cells with 
integrations with 11g ml! puromycin for one week. We picked individual colonies 
and used PCR to genotype clones, using primers spanning the insertion junctions. 
We sequenced these PCR products to determine the allele of insertion. Following 
genotyping, we expanded clonal cell lines and transfected them with PX330 anda 
pool of four sgRNAs to delete the selection cassette, leaving behind three tandem 
pASs. Following selection with 2,.g ml“! ganciclovir, we again picked individual 


colonies, used PCR to confirm loss of the cassette, and sequenced RNA from 
multiple clones. PCR primer sequences for cloning homology arms and genotyping 
p3PA insertions are listed in Supplementary Table 2. 

Knockouts of Blustr exons and introns. To delete each exon and intron of Blustr, 
we transfected cells with pools of guides as described for the promoter deletions, 
using two guides on each side. We assessed the genotype of clonal cell lines as 
described above for promoter deletions. To confirm exon knockout from RNA 
sequencing data, we examined SNPs in each of the exons. Upon knockout of exon 
2, for example, we observed loss of RNA sequencing reads mapping to exon 2, while 
reads mapping to other exons were still present. We also identified reads spanning 
a new splice junction between exon 1 and exon 3, further confirming that exon 2 
was removed from the mature transcript. For bar plots in Fig. 3 measuring Blustr 
expression, the values represent the normalized read counts of the remaining exons 
that were not deleted in that experiment. To confirm intron knockout, we used 
PCR primers spanning the deletion junction and sequenced the resulting PCR 
products. We note that the intron knockouts, by design, do not affect the sequence 
of the spliced Blustr RNA. 

5’ splice site knockout. To knock out the 5’ splice site of Blustr, we co-transfected 
mES cells as described above, using a single sgRNA pZB plasmid and 200 ng of 
ssDNA oligonucleotide donor for homologous recombination (Extended Data 
Fig. 8c). The oligo was ordered as an ultramer from Integrated DNA Technologies 
(Supplementary Table 2). We genotyped these insertions through amplicon 
sequencing using an Illumina MiSeq (primers in Supplementary Table 2). 
Transcriptional activity with GRO-Seq. We used precision run-on sequencing 
(PRO-seq)**, a variant of global run-on sequencing", to map transcriptionally 
engaged RNA polymerase for a subset of clones. Clones for PRO-seq (as well as 
ChIP-seq and assays for transposase-accessible chromatin with high-through- 
put sequencing (ATAC-seq)) were chosen from among the recoverable knockout 
cell lines with a preference for clones with homozygous knockouts or knockouts 
on the 129 allele only. We performed PRO-seq as previously described, with 
modifications. We harvested 10 million mES cells by scraping, washing in cold 
PBS, and spinning at 330g for 3 min. The cell pellet was resuspended in 1 ml cold 
douncing buffer (10 mM Tris-HCl pH 7.4, 300mM sucrose, 3mM CaCl, 2mM 
MgCh, 0.1% (v/v) Triton X-100, and 0.5mM DTT) per 1 million cells. The cells 
were incubated on ice in the cold room for 5 min and dounced 25 times. The nuclei 
were pelleted at 500g for 2 min, washed twice in 5 ml douncing buffer, and centri- 
fuged at 500g for 2 min. The nuclei were then gently resuspended in 10011 of cold 
storage buffer (10 mM Tris-HCl, pH 8.0, 25% (v/v) glycerol, 5mM MgAc», 0.1 mM 
EDTA, and 0.5mM DTT), immediately flash frozen, and stored at —80°C until use. 

A 2811 2 Nuclear Run-On (NRO) mix was prepared as follows: 1 M Tris-HCl, 
pH 8.0, 1M MgCh, 2M KCL, and 0.1 M DTT. 5,11 of 1 mM Biotin-11-CTP (Perkin 
Elmer), 1 pl of 0.05 mM CTP, 2.5411 of 2mM ATP, 2.51 of 2mM GTP, 2.511 of 
2mM UTP (Sigma Aldrich), 6.5 11 of nuclease free water, and 2 \1l of SUPERaseIn 
(Ambion) were added to the 2x NRO mix and mixed well before the addition of 
50 tl of 2% NLS. The NRO reaction mix was mixed well and preheated to 37°C. 
10011 of NRO mix was added to 100 1] of nuclei in storage buffer. The reaction 
was mixed gently by pipetting and incubated at 37°C for 3 min, mixing halfway 
through. To halt the reaction 50011 of TRIzol LS (Thermo Fisher) was added, 
mixed well, and incubated at room temperature for 5 min. RNA was isolated 
through a chloroform extraction and ethanol precipitation, and resuspended in 
2011 of H2O. The RNA was heat denatured at 65°C for 40s and fragmented on ice 
for 10 min with 5 11 of IN NaOH. To stop the reaction, 511 of 1 M acetic acid and 
2011 of 1 M Tris-HCl, pH 7.4 were added. To remove unincorporated biotinylated 
nucleotides, the sample was passed through a P-30 exchange column (BioRad). 
1 ul of RNase inhibitor was added to the ~50 11 of RNA and the first biotin enrich- 
ment was then performed. 

Each biotin enrichment was performed as follows. To prepare the Streptavidin 
M280 Beads (Invitrogen) for biotin enrichment, 100 1l of beads were taken per 
sample and washed once in 0.1N NaOH with 50 mM NaCl and twice in 100 mM 
NaCl. Beads were resuspended in 1601] of binding buffer (10 mM Tris-HCl, pH 
7.4, 300mM NaCl, and 0.1% (v/v) Triton X-100). To each sample an equal volume 
of Streptavidin M280 beads was added, mixed, and incubated on a rotator for 
20 min at room temperature. The beads were magnetically separated and washed 
twice in 50011 of ice cold high salt wash buffer (50 mM Tris-HCl, pH 7.4, 2M 
NaCl, and 0.5% (v/v) Triton X-100), twice in 500 il of binding buffer, and once 
in 5001 of low salt wash buffer (50 mM Tris-HCl, pH 7.4 and 0.1% (v/v) Triton 
X-100). To harvest the RNA, 300 11 of TRIzol (Thermo Fisher) was added to the 
beads, vortexed for 20s, and incubated at room temperature for 3 min. 6011 of 
chloroform was added and mixture was incubated at room temperature for 3 min. 
The samples were centrifuged at 14,000g for 5 min at 4°C. The aqueous phase 
was collected and transferred to a new tube; the remaining organic phase was 
removed from the beads. The TRIzol extraction was then repeated as above and 
the two aqueous phases were combined. RNA was purified with a chloroform 
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extraction and ethanol precipitation, and resuspended in nuclease-free water. RNA 
sequencing libraries were then prepared as described above, except that SILANE 
clean-ups were replaced with streptavidin-biotin capture enrichments until after 
reverse transcription (a total of three enrichments). 

We sequenced PRO-seq libraries to a depth of ~10 million 30-bp paired-end 
reads. To analyse the data, we mapped and processed the RNA sequencing data as 
described above, including aligning individually to the 129 and castaneus genomes. 
Figures showing ‘Allele-specific GRO-seq depict coverage for reads that uniquely 
map to the specific allele indicated in the figure. To assess the relative read density 
in the promoter-proximal region and gene body of Sfmbt2, we counted reads in 
the 2kb region downstream of the first Sfmbt2 TSS and in the remainder of the 
gene body*®. We calculated the pause index as the ratio of these two quantities, 
normalized to total read count. We noticed that different PRO-seq libraries had 
subtle biases in the relative fraction of reads aligning to the TSS versus the gene 
body, leading to slightly offset distributions of pause indices across all genes, and 
so we corrected for these biases in each library by normalizing TSS and gene body 
RPKMs to the median of the ~5,000 genes with coverage across all samples. 
Chromatin accessibility with ATAC-seq. Libraries were generated as previously 
described*” using 50,000 mES cells. We generated duplicate ATAC-seq libraries for 
each clonal cell line examined and sequenced each to a depth of ~40 million 30-bp 
paired end reads. We aligned paired-end DNA sequencing reads using Bowtie2 
(ref. 48) to each of the 129 and castaneus genomes with the following parameters: 
—met-stderr-maxins 1000, removed duplicate reads using Picard (http://picard. 
sourceforge.net), and filtered to uniquely aligning reads using samtools (MAPQ <30, 
https://github.com/samtools/samtools). For plotting normalized read coverage 
at the Blustr and Sfmbt2 promoters, we combined data from the two biological 
replicates (two independent measures of the same cell line) and connected paired- 
end reads to generate fragments. Fragment coverage was normalized by the total 
number of uniquely mapping reads. 

Chromatin immunoprecipitation. ChIP-seq for H3K4me3 and H3K27me3 was 
performed using monoclonal antibodies as previously described””. Sequencing data 
was analysed as for ATAC-seq described above. 

Validation of allele-specific RNA expression with ddPCR. To validate our 
RNA-seq based measurements of allele-specific expression, we used a quanti- 
tative allele-specific PCR assay to verify measurements for Blustr and Sfmbt2. 
We isolated RNA from harvested mES cells using RNeasy 96 columns and per- 
formed a DNase treatment followed by reverse transcription of 500 ng of RNA 
(total reaction volume 20,11). We performed droplet digital PCR (ddPCR) 
using Bio-Rad Custom ddPCR Assays that involve qPCR primers flanking 
a polymorphic site and two allele-specific fluorescent probes. For Blustr: left 
primer sequence: GACAAATACTCCCTTCAACA,; right primer sequence: 
GAACAGTTTGTCCTGCC; probe sequence: TAAGTGAGGTGAACTCCAAG 
(129 allele, FAM) or AGTGAGGCGAACTTCAAG (castaneus, HEX). For Sfmbt2: 
left primer sequence: TGTAAGTTTGCCTGATACTC; right primer sequence: 
TCTAATGTACCTCAGCCC; probe sequence: TTTCCTATGAGCAGTTCAAC 
(129 allele, FAM) or TCCTATGAACCGTTCAGC (castaneus, HEX). ddPCR was 
done with 2.211 of cDNA, 11 jul of Supermix (BioRad), 1.1 jl of each probe, and 
7.7 1 of water per reaction followed by droplet generation. PCR was performed as 
follows: 95°C for 10 min; cycling at 94°C for 30s and 55°C for 1 min for a total of 
40 cycles; and 98°C for 10 min. Readout was done using the QX200 Droplet Reader 
and Quantasoft Software (BioRad) to determine the total number of droplets 
containing each allele. We calculated allelic expression ratios from these values and 
compared it to values generated through RNA-sequencing and hybrid selection of 
the same RNA samples (Extended Data Fig. 2d, e). 

External ChIP-seq, RNA-seq, and DNase HS data. We used the following data 
from ENCODE”: H3K4me3, H3K4mel, H3K27ac, and CTCF ChIP-seq in 
mES cells (ES-Bruce4); DNase hypersensitivity sequencing in mES cells (E14); 
H3K4me3, H3K4mel, and CTCF ChIP-seq and DNase HS data in H1-hES cells; 
and RNA-sequencing data in H1-hES cells (nuclear poly(A)‘, nuclear total). To 
assess transcription factor binding to mRNA and IncRNA promoters (Extended 
Data Fig. 7c), we examined mES cell ChIP-seq peaks available from Kagey et al. 
at the Gene Expression Omnibus (GSE22562)*!. 

DNA purification for examining proximity contacts. To examine the proximity 
contacts of the linc1405 locus, we used the RAP-DNA protocol, which we initially 
developed in order to map RNA localization to chromatin, to capture linc1405 
DNA”. In brief, we cross-linked live cells to fix endogenous chromatin complexes, 
then purified a target DNA region using a pool of oligonucleotides targeting the 
linc1405 locus (Supplementary Table 3). Here, we used probes that are the same 
strand as the linc1405 RNA—in this way, we specifically capture the linc 1405 
DNA and do not directly capture the linc1405 RNA itself. We mapped the 3D 
proximity contacts of the linc1405 locus through high-throughput sequencing of 
co-purified DNA and calculated the normalized enrichment to an input DNA 
library in 1-kb windows (Extended Data Fig. 7e). Annotations for topologically 


LETTER 


associated domains (TADs) were downloaded from the Ren Laboratory 
(http://chromosome.sdsc.edu/mouse/hi-c/download.html)*. 

LncRNA transcript annotations. For evolutionary conservation analysis, 
we used IncRNA annotations and isoforms previously defined based on RNA 
sequencing in mouse embryonic stem cells, combining annotations generated with 
multiple methods (Scripture*! and slncky”*). We filtered the combined list using 
slncky’s to eliminate transcripts predicted to encode proteins or micropeptides 
by UCSC, transcripts that partially align to protein-coding genes (for example, 
pseudogenes or incomplete reconstructions), and species-specific coding gene 
duplications. Subsequently we performed several manual curation steps. We 
examined each isoform using a combination of long-read RNA-sequencing data, 
total chromatin-associated RNA sequencing data, cap analysis of gene expres- 
sion (CAGE) data, and poly(A)* 3/-end sequencing data from mES cells78704193, 
We eliminated transcripts that appeared to result from an extended 3’ UTR of 
an upstream protein-coding transcript. Because the precise 5’ ends of transcripts 
are imprecisely assigned based on RNA-sequencing data alone, we re-assigned 
5! ends (TSSs) using a sliding-window approach to find the 10-bp window with the 
highest number of same-strand CAGE reads within 300-bp of the initial calculated 
TSS. We additionally manually curated the TSS of each IncRNA, some of which 
were incorrectly assigned by more than 300 bp, based on CAGE and H3K4me3 
ChIP-seq data, and eliminated any where we could not identify the TSS (for 
example, due to an unmappable sequence or very low abundance). 

Analysis of IncRNA and promoter conservation. To categorize IncRNAs by 
their conservation properties and promoter locations, we examined a set of 307 
IncRNAs expressed in mES cells as described above. We assessed the conservation 
of each IncRNA through a two-step approach. We first used slncky to look in 
syntenic locations for evidence of IncRNA transcripts in deep poly(A)t RNA- 
seq of rat, chimp, and human induced pluripotent stem cells (iPSCs)**. LacRNAs 
called ‘conserved’ by this first filter have substantial evidence based on RNA-seq 
that allows for independent reconstruction of the transcript in one or more of 
these other organisms. We categorized the remaining IncRNAs by the location of 
their TSS: 71 IncRNAs originate within 500-bp of an mRNA TSS on the opposite 
strand (divergent); 59 IncRNAs originate within the long-terminal repeats (LTRs) 
of endogenous retroelements; and 79 IncRNAs have their promoters in intergenic 
regions that do not overlap with LTRs and do not emerge from a bidirectional 
mRNA promoter (henceforth, ‘intergenic). 

Because some conserved IncRNAs might be expressed at too low a level to 
assemble a transcript de novo in a given species, we examined more closely the 79 
intergenic IncRNAs that were called ‘mouse-specific in the initial slncky analysis. 
We applied a second, more stringent threshold to remove IncRNAs misclassified 
as mouse-specific due to low abundance. For each intergenic IncRNA locus, we 
used liftOver*! to map the 10 bp surrounding the mouse TSS (mm49) to the human 
genome (hg19) (minMatch=0.1, UCSC chain). 37 of these transcripts did not lift 
over at this step, and thus were considered mouse-specific. For the 42 that did lift 
over, we examined the syntenic region for evidence of poly(A)+ RNA-seq data from 
human iPSCs”* or poly(A)* nuclear-fraction RNA-seq from hES cells (-100 to 
+900 bp relative to the TSS), or for evidence of poly(A)* nuclear-fraction or whole- 
cell CAGE from hES cells (-250 to +250 bp relative to the TSS), and removed from 
consideration any IncRNAs that showed evidence for RNA-seq or CAGE above 
a certain threshold. We chose this threshold based on a set of random intergenic 
regions, which were matched to the set of intergenic mouse-specific IncRNAs based 
on GC content. We eliminated from consideration the ten IncRNAs that showed 
RNA-seq or CAGE signals greater than the 90th percentile of random regions, 
corresponding to approximately two CAGE or RNA-seq reads in the windows 
described above. These ten IncRNAs were added to the ‘conserved’ section of 
the pie chart in Fig. 4a. Several of these ten IncRNAs correspond to substantially 
shortened, single-exon poly(A)* transcripts that show minimal overlap with the 
syntenic exons in mouse; although a majority of the exonic sequence of these 
transcripts are not in fact conserved between human and mouse, we excluded these 
from consideration as putative mouse-specific IncRNAs. 

For the purposes of examining the conservation properties of these intergenic 
mouse-specific IncRNAs, we defined a matched set of ‘enhancer’ elements. We first 
generated a list of regulatory elements in mES cells using the DNase hotspots called 
by ENCODE-UW in ES-E14 cells. As an estimate of the activity of each element, we 
calculated the density of H3K27ac reads in the region. From the set of intergenic 
elements that did not overlap a promoter, IncRNA promoter, or LTR, we selected 
a random subset matched to the intergenic IncRNA promoters for H3K27ac 
density (binned by 10 reads per bp) and distance to the TSS of the closest active 
gene (binned by 5kb). We call these elements ‘enhancers’ because they are marked 
by DNase hypersensitivity and H3K27ac but do not overlap a known gene 
promoter. 

We compared the sequence conservation and functional conservation of three 
classes of elements: intergenic mouse-specific IncRNAs, matched intergenic 
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enhancer elements, and GC-matched random intergenic elements. First, we 
computed the rate at which each set maps to human sequence. We centred each 
element and used liftOver (-minMatch = 0.1) to identify the syntenic region in 
the human genome. Elements that did not lift over at this step correspond to the 
white segment of the pie charts in Fig. 4 (iii, did not map). For elements that did lift 
over to human, we next defined the subset that map to putative regulatory elements 
in human. We examined a 500-bp window centred on the lifted over region and 
counted reads in hES cell DNase-seq data from ENCODE. We defined regions 
showing DNase HS scores higher than 95% of the mappable random intergenic 
regions as putative DNA regulatory elements. We note that these random inter- 
genic regions include some enhancers; they are matched to IncRNA promoters 
for GC content, and thus frequently correspond to regulatory elements (which 
are GC-rich) that happen to be active in hES cells. For both intergenic mouse- 
specific IncRNAs and enhancers, ~33% of elements corresponded to putative 
DNA regulatory elements in human (Fig. 4d), representing a ~6.6-fold enrichment 
versus the random intergenic controls. To compare sequence conservation of 
these classes of elements, we calculated the average SiPhy score” across each 
500-bp region surrounding the mouse TSS or the centre of the enhancer 
element, using the 29 mammals alignment from the mouse perspective”®. 
We used a two-sided Mann-Whitney U-test to look for changes in the distributions 
of SiPhy scores to the set of mappable random intergenic regions (Fig. 4d: 
random ii + iii). 

Impact of expression level on conservation analysis. Although the set of inter- 
genic mES cell IncRNAs examined above does not show any significant evidence 
for poly(A)* RNA in the syntenic locus in human, some of these transcripts may 
not be detected in human and yet still be truly conserved. These transcripts might 
be misclassified as mouse-specific IncRNAs for several reasons, including: (i) low 
expression level in hES cells and iPS cells such that the IncRNA, by chance, is not 
detected based on the depth of sequencing data available; or (ii) the ncRNA is not 
expressed in hES cells or iPS cells, but is expressed in a different human cell type 
and thus may have a conserved function. 

To estimate the false positives resulting from these and other scenarios, we 
examined the properties of a set of 853 conserved mRNAs matched to the inter- 
genic mouse-specific IncRNAs based on expression in mES cells. We counted 
the frequency at which these mRNAs would be called ‘not conserved’ by the 
same procedures described above: we applied the nuclear poly(A) CAGE and 
RNA-seq filters to eliminate transcripts that show detectable transcription in the 
1-kb region near the TSS. While 87% of the intergenic IncRNAs described above 
passed these filters (and thus appeared to be mouse-specific), only 22% of the 
expression-matched mRNAs passed; this indicates that the set of 69 mouse-specific 
intergenic IncRNAs are approximately 3.9-fold enriched for human elements that 
are not transcribed in hES cells. Thus, the mouse-specific IncRNAs defined above 
appear to consist largely of transcripts that are not conserved. 

We performed the following additional analyses to ensure the robustness of our 
conclusions regarding the existence of IncRNAs that evolved from ancestral regula- 
tory elements. First, we examined the conservation of the first 5’ splice sites of this 
set of IncRNAs. In 7 of these 11 loci, the GT dinucleotide in the first 5’ splice site 
is not conserved, suggesting that a similar spliced transcript cannot be produced 
from this locus. Second, we re-performed the entire conservation analysis focusing 
on the 50% of mES cell intergenic IncRNAs with the highest expression levels; 
these IncRNAs are less likely to be missed in hES cells due to low abundance. We 
also adjusted our poly(A)+ RNA and CAGE filters to require a complete absence 
of reads in the corresponding regions in hES cells and iPS cells. Using these 
filters, 79% of the intergenic IncRNAs are not detectably expressed in human 
cells, representing a ~12-fold enrichment over mRNAs matched for expres- 
sion level. Therefore we are confident that most of these IncRNAs are correctly 
classified as mouse-specific. Of the 30 intergenic IncRNAs called mouse-specific 
by this more conservative analysis, 5 do indeed correspond to putative DNA 
regulatory elements, including linc1494 (Fig. 4c), representing a >8-fold enrich- 
ment versus GC-matched random sequences (chi-squared P< 1 x 10~1°). Thus, 
our conclusions that some IncRNAs appear to evolve from ancestral regulatory 
elements are robust even with stringent thresholds. 

Software for data analysis and graphical plots. We used the following software 
for data analysis and graphical plots: R Bioconductor (version 3.0)°”, Gviz (version 
1.10.11), gplots (version 2.17.0), GenomicRanges (version 1.18.4)°8, rtracklayer 
(version 1.26.3)*°, BEDTools™, Integrative Genomics Viewer (version 2.3.26)", 
and vcftools (version 0.1.12). 

Data availability. Sequencing data for this study is available at the Gene Expression 
Omnibus (GSE80262 and GSE85798), and additional visualizations of the data are 
available at http://pubs.broadinstitute.org/neighboring-genes/. 


Code availability. Code for the analyses described in this paper is available from 
the authors upon request. 
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Extended Data Figure 2 | Generation of knockout clones and 
measurement of allele-specific RNA expression. a, Overview of 
knockout and measurement protocol. b, Distribution of allelic expression 
ratios (number of informative reads mapping to 129S1 allele divided by the 


number mapping to either the 129 


S1 or the castaneus allele) across active 


genes in mES cells. c, Scatter plot of allelic expression ratios for genes 
with RPKM > 2 that have more than 100 allele-informative reads across 
all libraries. Allelic expression ratios are consistent in RNA sequencing 
data before and after hybrid selection (HS). d, e, Allelic expression ratios 
as measured by two independent methods for Blustr (d) and Sfmbt2 (e) 
expression in 15 clonal cell lines containing genetic modifications in 
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the Blustr locus. Each dot represents the mean of two ddPCR technical 
replicates (x axis) and the value from one RNA-seq technical replicate 

(y axis). f, Example locus showing hybrid selection strategy and RNA-seq 
coverage for cell lines with the indicated genotype for deletion of the Bendr 
promoter. The y axis scales represent normalized read counts and are the 
same for all hybrid selection tracks. The absolute level of expression for 
any given gene varies among clonal cell lines; throughout this work, we 
instead consider the relative level of expression between the two alleles in 
heterozygous knockout cells. For similar plots of each gene studied, 

see http://pubs.broadinstitute.org/neighboring-genes/. 
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Extended Data Figure 3 | Read-through transcription at Meg3 and 
Snhg3 loci. a, Snhg3 promoter knockout reduces the levels of Rccl mRNA 
by 23%. However, sequencing of chromatin-associated RNA shows 

that transcription continues past the annotated 3’ end of Snhg3 into the 
downstream Rccl gene (see Methods). This read-through transcription 
creates a fusion transcript containing exons of both Snhg3 and Rccl, 

as well as intergenic RNA. We note that this fusion transcript is also 
annotated in the syntenic human locus as an alternative isoform of RCCI1. 
Bars, relative poly(A)* RNA expression on modified versus unmodified 


= 


alleles. Error bars, 95% confidence interval for the mean (n > 2 alleles, 
see Supplementary Table 1). b, Meg3 promoter knockout eliminates the 
expression not only of Meg3 but also of two additional IncRNAs encoded 
downstream in a tandem orientation (Rian and Mirg). Although these 
three IncRNAs are annotated as separate genes, they appear to be derived 
from a single transcript driven by the Meg3 promoter. This is consistent 
with the presence of continuous chromatin-associated RNA throughout 
the locus and a lack of CAGE reads at the 5/ ends of Rian and Mirg3. 
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Black, knocked-out IncRNA; blue, gene with significant allele-specific 
change in gene expression (FDR < 10%). Independent clones are not 
expected to yield the same significance value (z-score), in part because 
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Extended Data Figure 6 | Dissecting mechanisms for how gene loci 
regulate a neighbour. a, Three categories of possible mechanisms by 
which a gene locus might regulate the expression of a neighbour. b, We 
used two strategies to insert pAS downstream of gene promoters. In the 
first strategy, we inserted a 49-bp synthetic pAS (spA) using a single- 
stranded DNA oligo with 75-bp homology arms (see Methods). c, In the 
second pAS insertion strategy, we cloned a donor plasmid containing 

a selection cassette and three different pAS sequences (see Methods). 
Homology arms of 300-800 bp were used to integrate the cassette. After 
isolating clones with successful insertions, we used a second round of 
transfections to remove the selection cassette, leaving behind three tandem 
pASs. EFS, elongation factor 1 promoter; Puro, puromycin resistance gene 
(pac); HSV-tk, herpes simplex virus thymidine kinase. 
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Extended Data Figure 7 | Promoters of IncRNAs and mRNAs have 
enhancer-like functions. a, Allele-specific GRO-seq signal for clones with 
the indicated modifications at the Bendr locus. Only reads specifically 
mapping to one of the two alleles are shown. The y axis scale represents 
normalized read count and is the same for all tracks. b, Allele-specific 
poly(A)* RNA expression for genetic modifications at the linc1405, 
Snhg17, Gpr19, and Slc30a9 loci. Bars, average RNA expression on 
modified compared to unmodified (wild-type) alleles. Error bars, 

95% confidence intervals for the mean (n > 2 alleles, see Supplementary 
Table 1). Grey arrows indicate distance from the targeted locus promoter 
to the affected neighbouring gene. We note that, based on their location, 
the Snhg17 and Gpr19 pAS insertions probably allow more substantial 
splicing and transcription; for these loci, it is clear that the majority of 
the transcript is dispensable but it is possible that transcription close to 
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the promoter may be involved in the cis-regulatory function. c, Presence 
(grey) or absence (white) of various chromatin marks and transcription 
factors in mES cells in a 1.5-kb window centred on the TSS of each 
targeted gene. d, Distance from each knocked-out gene to its neighbouring 
target gene (x axis) versus the magnitude of the effect on the expression 
of the neighbouring gene (per cent compared to wild-type, y axis). Blue 
genes represent those discussed in main text; grey genes are discussed in 
Supplementary Note 5. e, Proximity-based contacts between the linc1405 
and Eomes loci. The y axis shows enrichment in a sequencing-based 
proximity assay in which we used antisense oligos to capture linc1405 
DNA and any interacting, cross-linked proximal DNA (see Methods). 
TAD annotations are derived from Hi-C experiments in mES cells 
(see Methods). Blue arrow, focal contact between the linc1405 and 
Eomes loci. 
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Extended Data Figure 8 | Characterization of genetic modifications 

in the Blustr locus. a, Allele-specific GRO-seq signal for clones with 

the indicated modifications at the Blustr locus. Only reads specifically 
mapping to one of the two alleles are shown. The y axis scale represents 
normalized read count and is the same for all tracks, and is magnified 
five times at the indicated location to better visualize the reads in the 
Sfmbt2 locus. b, Quantification of allele-specific GRO-seq signal in the 
Sfmbt2 locus on alleles modified as indicated. TSS, region including the 
two alternative TSSs of Sfmbt2 and 2 kb downstream; gene body, region 
containing the remainder of the Sfmbt2 gene locus; pause index, ratio of 
TSS to gene body. Dashed grey lines indicate the 95% confidence intervals 
for the mean of eight wild-type clones. Bars, n = 8 for wild-type and n= 1 
for others. c, Schematic of the 5’ end of the Blustr locus and genotypes 

of two knockout clones. The 5’ splice site is located 78 bp downstream of 
the Blustr transcription start site (in this panel, Blustr is transcribed from 
left to right). One of the alleles from the two clones contains insertion of 
the oligo mediated by homologous recombination; the remaining three 
alleles contain insertions or deletions resulting from non-homologous 
end joining repair of ssRNA-mediated double-strand breaks, some of 


— 5’ss deletion (—/-) 


which also disrupt the 5’ splice site. Bar plots show allele-specific RNA 
expression for knockout clones and control clones (n = 18 for +/+, 1 for 
others). Error bars, 95% confidence interval for the mean. d, Schematic of 
the observed splice structures of Blustr RNA transcripts in poly(A)* RNA 
sequencing of the exon deletion clones. Each deletion removes a region 
including ~50-200 bp on either side of the exon, thereby removing both 
the exon and its splice sites. The Exon 4 deletion removes the endogenous 
pAS, leading to new isoforms of the IncRNA transcript that splice into two 
cryptic splice acceptors downstream. e, GRO-seq, H3K4me3 ChIP-seq, 
and chromatin accessibility (ATAC-seq FPKM) at the Blustr and Sfmbt2 
promoters in cell lines with the indicated genotypes. Deletion of the first 
5! splice site leads to a significant reduction in H3K4me3, RNA polymerase 
occupancy, and chromatin accessibility at the Blustr promoter, as well as 
H3K4me3 and RNA polymerase occupancy (but not accessibility) at the 
Sfmbt2 promoter. f, H3K27me3 ChIP-seq at the Blustr and Sfmbt2 loci in 
cell lines with the indicated genotypes. Deletion of the Blustr promoter or 
5! splice site leads to spreading of the repression-associated H3K27me3 
modification across a ~30kb region. 
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Extended Data Figure 10 | Classification of IncRNAs based on 
conservation and promoter location. a, Classification of 307 IncRNAs 
expressed in mES cells. ‘Conserved’ transcripts are those that show 
significant evidence of cap analysis of gene expression (CAGE) data 
and/or poly(A)* RNA in syntenic loci (see Methods). Divergent, initiating 
within 500 bp of an mRNA TSS, on the opposite strand; ERV, endogenous 
retroviral repetitive element (see Supplementary Note 9). Box plot shows 
sequence-level conservation of the promoters of subsets of IncRNAs 
expressed in mES cells. Random intergenic regions are matched to IncRNA 
promoters by GC content. Positive SiPhy score indicates evolutionary 
constraint on functional sequences. Orange category corresponds to 
mouse-specific IncRNAs that appear to have evolved from ancestral 
regulatory elements (REs) and correspond to sequences that show 
evidence for DNase I hypersensitivity in human embryonic stem cells. 
Significance is calculated compared to random intergenic regions using 


transcription + RNA processing 
can enhance cis function 
a Mann-Whitney U-test. ***P < 0.001. Box represents first and third 
quartiles; centre line represents median; whiskers represent data within 
1.5x the interquartile range. b, Chromatin and RNA data for 11 mouse- 
specific IncRNAs that appear to have evolved from ancestral regulatory 
elements. In mouse, these elements show evidence for CAGE, H3K4me3, 
and DNase I hypersensitivity, consistent with their roles as promoters. 
The syntenic sequences in human do not show evidence for CAGE but 
nonetheless are DNase I hypersensitive and are frequently marked by 
H3K4mel and/or CTCE. c, Model for evolution of IncRNAs from pre- 
existing enhancers, which often initiate weak bidirectional transcription. 
Spliced transcripts may neutrally appear through the appearance of splice 
signals and loss of polyadenylation signals. In some cases, transcription, 
splicing, or other RNA processing mechanisms may feed back and 
contribute to the cis-regulatory function of the promoter, producing a 
IncRNA as a by-product. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature19808 


Corrigendum: An early 
geodynamo driven by exsolution 
of mantle components from 
Earth’s core 


James Badro, Julien Siebert & Francis Nimmo 


Nature 536, 326-328 (2016); doi:10.1038/nature18594 


Some of the analyses (samples X1-2, X1-3, X1-4 and X2-4) reported 
in this Letter’s Extended Data Table 1 and in the Supplementary Data 
spreadsheet were previously published in refs 28 and 30. The other 
analyses are new measurements, as are all measurements of aluminium 
concentration in the metal we studied. Owing to a typographical error, 
an erroneous value for the magnesium composition of sample X2-4 
was reported in refs 28 and 30 (which has no impact on the conclusions 
of those papers because magnesium concentration was not used). Our 
recent re-evaluation of the raw microprobe data confirms that we have 
here reported the correct value (0.94 mol% in Extended Data Table 1, 
corresponding to 0.59 wt% in the Supplementary Data spreadsheet) for 
the magnesium composition of sample X2-4. (Therefore, no correction 
to this Letter is required.) 
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The gamble of a PhD hiatus 


Pressure to barrel through a PhD can make taking a break seem daunting — but there are 


ways to limit career damage. 
BY ERYN BROWN 


PhD students to take time out from the 
pursuit of knowledge and the lab. 

It can be a tough call for students to make. 
Immersed in their work with no assurances of 
a great job, driven to scour the literature to stay 
current and primed to worry about competi- 
tion and impressing their advisers, many PhD 
students think that academic success is every- 
thing. For them, nothing comes before their 
studies and research programme. 

Breaks are risky — there is no way to ensure 


| ife can get in the way of science, forcing 


a smooth return to studies, funding and the 
bench. University policies governing gap time 
vary across nations, regions and institutions, 
and maintaining funding and research conti- 
nuity can pose hurdles. Attitudes towards time 
off also differ widely. Many faculty members 
and potential future employers look askance 
at a doctoral student’s decision to step aside, 
even for a brief period. 

Your capacity to put your PhD programme 
on hold will depend largely on your field, 
your institution and your advisers. In gen- 
eral, you can take a break when you need it, as 
long as you are prepared for the consequences 


17 NOVEMBER 2016 


— particularly ifyou aim to pursue an academic 
career. The decision could affect your reputa- 
tion, publishing record and ability to stay cur- 
rent with your research programme. But with 
careful planning, there are ways to soften the 
blow (see “How to take a successful break). 

Few statistics exist on how often, for how 
long or why PhD students take time off from 
their studies. In the United States, neither the 
National Science Foundation nor the Council of 
Graduate Schools tracks leaves of absence or can 
point to a central source for such data. 

Some individual institutions provide 
estimates of how many PhD students have 
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> taken breaks each year. Heather Amos, 
a spokesperson for the University of Brit- 
ish Columbia in Vancouver, Canada, says 
that about 50 PhD students out of nearly 
4,000 across all disciplines, or about 1.25%, took 
a leave of absence in 2015. 

Martin Grund, spokesperson for the Max 
Planck Institutes’ graduate-student organiza- 
tion, PhDnet, says that his group doesn't track 
leaves of absence. 


But, he says, internal “The way 
surveys show that technology 

7% of doctoral stu- is changing, 
dents at the institutes taking a break 
in 2012 were parents, can become 
and so had probably difficult. How 
taken parentalleave do you stay 

at some point. cutting edge?” 


Some funding 
agencies allow for certain interruptions of 
study, including care for children and elderly 
people, professional development and other 
life needs. Some universities permit students 
to retain access to campus services while on 
leave for a variety of reasons; others have no 
defined policy. 

Anecdotally, it seems that few PhD students 
so much as think about a pause in their pro- 
gramme. “I think most don’t even consider it? 
says Heather Buschman, who earned a PhD in 
molecular pathology from the University of 
California, San Diego, School of Medicine after 
taking six months off for a US National Cancer 
Institute communications internship in 2006. 
“They think, ‘I could never do that: People are 
on sucha focused trajectory and see any waver- 
ing as a negative.” 

There is a great deal of external and internal 
pressure to race to the finish, agrees Gareth 
O’Neill, a PhD candidate in linguistics at Leiden 
University in the Netherlands. “Doing a PhD is 
a relatively focused and driven occupation — 
once started, you just want to finish it,” he says. 

As a board member of the PhD Candidates’ 
Network of the Netherlands, O’Neill is involved 
with an initiative called the Professional PhD 
Program, which helps to place PhD students 
who seek work experience outside academia. 
O’Neill says that the programme rarely receives 
applications from students who feel they need 
to stay at the bench throughout their doc- 
toral studies, but that those who do apply 
sometimes experience pressure from 
supervisors to finish their PhD sooner. 
That pressure, he adds, is misguided or 
inapplicable — particularly from mid- 
or late-career scholars, who don't know 
or who don’t want to admit how hard 
it is for new PhD students to remain in 
academia now. “We hope to bring about 
a shift of mindset,” he says. 


HIGHTAIL IT OUT 
Still, when the need for a hiatus arises, some 
don't hesitate to take it — and then sail 
through their leave and back. Earlier this 


year, Anna Miller earned a PhD in parks, rec- 
reation and tourism management from North 
Carolina State University (NCSU) in Raleigh. 
She says that she never questioned her deci- 
sion to leave her research behind for half a 
year, when her Brazilian fiancé was offered a 
postdoctoral appointment in Portugal. The 
travails of a long-distance relationship had 
become burdensome, and she wanted to join 
him abroad. “Academically, I was getting a bit 
burnt out, but it was really the strain on my 
personal life that was the problem,” she says. “If 
I was going to stay in the programme, I needed 
to deal with the personal part of my life.” 

Her adviser was concerned that she might 
not come back, and Miller herself says that 
she left for Portugal knowing that might be 
true. But in Lisbon, she found herself drawn 
to nearby parks, and started studying how they 
were managed, just for fun. 

“Tt was a refreshing way to look at the same 
questions from a different perspective and 
reaffirm my desire to study this subject;’ she 
says. “I came back with new energy for being a 
full-time student” 

Re-entry turned out to be easy. To get 
approval for a leave of absence, Miller and her 
advisers had already agreed on a formal plan 
for her return, charting out how she would later 
complete course work, research and exams. 

They had also predetermined how Miller’s 
funding, which was suspended while she 
was away, would be reinstated. Everything 
unfolded as planned — and Miller became 
treasurer and then co-president of her depart- 
ment’s graduate-student association. She also 
began to mentor other students and to organ- 
ize career panels and other programmes. 

Three years on, Miller and her fiancé have 
since married, and she is nowa 
resident lecturer at the School 
for Field Studies Center for 
Marine Resource Studies in 
the Turks and Caicos Islands. 
She teaches undergraduates 
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who are studying abroad. “I can't think of any 
negatives of taking the time off? she says. 

Others also report a positive experience. 
“Ideally, Id say don't take time off, but if you do, 
don't judge yourself harshly,’ says Jen O’Keefe, 
a geologist and science-education researcher at 
Morehead State University in Kentucky. 

She took a pause from her PhD studies in 
2002, after a working relationship with an 
adviser fell apart. Ultimately, she devised a new 
plan that combined part-time work on her doc- 
torate with a full-time teaching schedule. 

Looking back, she thinks that her research 
career benefited from the five-month break, 
which enabled her to refocus her work towards 
palaeoecology and curriculum and instruc- 
tion, as well as a variety of other pursuits that 
she loves. “Everything from fly-ash geochem- 
istry to honey studies to sinkholes,’ she says. 
“No two PhD situations are the same. You have 
to do what's right for you.” 

Some think that their field of study smoothed 
the way. Benedikt Herwerth, who studies theo- 
retical quantum physics at the Max Planck 
Institute of Quantum Optics in Garching, Ger- 
many, says that he had little trouble setting up 
two stints of paternity leave, for a total of seven 
months, after the birth of his daughter in Febru- 
ary. He says that Germany’s generous approach 
towards parental leave helped, but that his field 
of study might also have facilitated the inter- 
ruption. “I'm not doing experiments,” he says. 
“Tt might be an advantage.” 


PERSONAL BARRIERS 
Even when there are no obstacles to taking 
time off, trouble might arise that complicates 
a student’s return. Eleanor Harding, a doc- 
toral candidate at the Max Planck Institute 
for Human Cognitive and Brain Sciences in 
Leipzig, Germany, has taken two breaks from 
her PhD research on how the brain pro- 
cesses music and language. The first was 
in 2011, when her mother died. Harding 
took several months off. “I lost my edge 
for quite a while,’ she says. “But my adviser 
encouraged me to keep going.” Harding 
returned to work later that year and expected 
to earn her degree in 2013 — until an experi- 
ment fell through, which caused delays, and 
pregnancy complications rendered her 
unable to work. 

In May 2014, after her daughter 
was born, Harding returned to her 
research, but she found that she could 

not afford enough childcare to resume 
her studies full-time. In addition, while she 
had been out, other researchers had pub- 
lished work in her area, so she had to redi- 
rect her research to examine a narrower 
question that would respond to the other 
scientists’ work. “Now, unfortunately, ’'m 
in the middle of the pack instead of at the 
front,’ she says. “I can't say I was the first.” 
A gap of just one year can put a PhD 
candidate behind when it comes to 
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DAMAGE CONTROL 


How to take a successful break 


Even when you know that stepping away 
from your PhD studies makes sense, it can 
create problems, from disruptions in your 
research to unmerited damage to your 
reputation. To finish your programme and 
launch your career, try to take these steps. 
@ Plan, plan, plan PhD candidates who 
successtully take leave say that charting 
out their eventual return was as important 
as departing in the first place. Make 

sure that you understand how your time 
away will affect your funding, teaching 
responsibilities or other roles in your 
department. Plan how you'll continue your 
research once you return, even if you think 
its specific direction could change. 

@ Find your champion Make sure you have 
a person in your corner — ideally, your 
adviser — to stick up for you when you 
encounter resistance. “You have to have 
someone who would say, ‘This person is 
worth carrying for six months, or a year, 
because they bring great value to my 


my 


lab and to the institution,” says Nathan 


mastery of important technological advances, 
warns Kim First, president and chief execu- 
tive of the recruiting firm Agency Worldwide 
in Encino, California. As a headhunter who 
searches for PhD graduates for jobs in biotech- 
nology and pharmaceutical companies, she 
says that she encounters few candidates who 
have interrupted their doctoral programme. 
“The way technology is changing, taking a 
break can become difficult,’ she says. “How 
do you stay cutting edge?” 

Other recruiters say that taking time away 
to have children or for other life events can 
hurt a researcher's scientific reputation, and 
that students should find ways to incorporate 
those obligations into their PhD programme 
without putting their research on pause. 

Some think that the stigma might be worse 
for women. Justin Schwartz, head of the mate- 
rials science and engineering department at 
NCSU, has helped students to organize leaves 
of absence. When it comes to parental leave, he 
says, women are more likely than men to take 
the time off — but those who do are often ter- 
rified (sadly, with some reason, he notes) that 
faculty members will think that they lack the 
drive to be the best and will extrapolate that 
women aren't suited to doing science. 

But whether female or male, most students 
experience one clear consequence after tak- 
ing the break: they lose momentum. Hard- 
ing says that although there was a benefit 
to delaying her dissertation — a compet- 
ing paper helped her to solve a problem in 
her data — she now has few job leads near 


Vanderford, who researches training 
issues in academia at the University of 
Kentucky in Lexington. “You have to have 
an advocate.” Others, including savvy 
administrative staff, can prove key in 

the nuts and bolts of making 
arrangements. 

@ Open your mind One reason that 
students decide not to take a leave of 
absence is because of fear that it will 
damage their chances of getting a 
conventional academic job. Buta little 
experience outside the ivory tower might 
teach you that you would actually be 
happier applying your degree elsewhere 
— in industry, for instance. “All the things | 
really liked the most were the things most 
scientists hated,” says Heather Buschman, 
who earned her PhD from the University of 
California, San Diego. In Buschman’s case, 
having an adviser who had worked as a 
clinician and encouraged her to pursue a 
different direction — communications — 
was crucial, she says. E.B. 


her husband’s medical residency in the 
Netherlands, and attributes that to having 
lost potential publications and chances to 
attend more conferences. 

“Your worth is based on quantitative 
measures like an impact factor,” she says. 
“They want people with publications. Life 
doesn't always cooperate.” Harding is now 
networking locally — getting involved, for 
instance, with a organization in the region 
that funds research into Parkinson's disease. 

O’Keefe wishes that the harsh judgement 
werent there, but says that it seems specific 
to academia. “People feel badly and a lot 
of scientists out there judge them harshly,” 
she says. “There's a lot of, ‘If you had to take 
time off, you're not really good enough to 
finish.” She says that many early-career sci- 
entists she knows who interrupted their PhD 
programmes eschewed academic research in 
the end, and instead, accepted positions in 
industry or teaching. Now in her early 40s 
and a mother, she says that she wouldnt have 
done anything differently, and looks forward 
to expanding her research. 

“I was on the fast track and I was moving 
too fast,’ she says. “A lot of good comes from 
taking a break and reassessing your priorities. 
A year off is sometimes the best thing you 
can do. The big message is, it's OK and you're 
not alone and you can go on to be what you 
want to be.” = 


Eryn Brown is a freelance writer in Los 
Angeles, California. 
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TRADE TALK 
Industrial shift 


As a staff scientist 
at Thermo Fisher 
Scientific in San 
Diego, California, 
Shikha Mishra 
investigates how to 
deliver DNA and 
other materials into 
cells and organisms. 
She explains how 
and why she left academia for a job in 
industrial research. 


Describe your workday yesterday. 

I had run an experiment the night before 
and had a bunch of data. I analysed them for 
about an hour and talked with my manager 
about what I'd seen and tried. Then we 
formulated a plan and I went back into the 
lab and set up another experiment. That was 
pretty standard — very similar to what goes 
on in academia. 


How did you learn about jobs outside 
academia? 

My father was an industrial physicist, and I 
knew that he was happy. But my biggest diffi- 
culty was that my friends and I didn't under- 
stand where we could fit within the biotech 
industry. I didn't know the different sectors: 
manufacturing and quality control, market- 
ing, product management and development. 
The way I learned was buying many, many 
cups of coffee for working people who agreed 
to meet with me. I'd ask: “What do you do? 
How did you find this opportunity?” 


When did you decide on industry? 

Four years into my postdoc at Harvard 
Medical School in Boston, Massachusetts, I 
was happy and my projects were going well 
— but the commitment time-wise was extra- 
ordinary. It was 7 days a week, and there were 
times when it was 14 or 16 hours of work a 
day. I was coming to a transition because I 
was about to get married. I thought: “Life has 
got to fit into my career plans.” 


What do you wish you had done differently in 
your job search? 

I wasn’t thinking past the grant deadline 
or the next publication. I should have had 
a LinkedIn profile before starting my job 
search, because I had to track down old 
friends — contacts whom I may not talk to 
every day, but whod be happy to help me. = 


INTERVIEW BY MONYA BAKER 
This interview has been edited for length and clarity. 
For more, see go.nature.com/2fwrzyg 
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Ua SCIENCE FICTION 


BY KEVIN LAUDERDALE 


cc elcome to Galactic Confedera- 
tion processing. Please state 
your name.” 


The creature behind the counter — they 
used actual counters, even this deep in the 
heart of the Galaxy — looked like a giant 
chartreuse bear with black fins running 
down her spine. Marta assumed the bear 
was a she. The translator certainly produced 
a female voice. The bear wasn’t wearing a 
name tag or anything. 

“Ambassador Marta Rilla-Chen;’ Marta 
replied clearly and directly. “Of the Earth 
Diplomatic Corps.” She was there to present 
her credentials to the Confederation. But 
the ceremony wasn‘ being held in a throne 
room or a hall bedecked with ceremonial 
splendour. She was in an office of some sort, 
whose multiple counters and flickering 
lighting reminded her of the fluorescents in 
the DMV back home. Only the view of the 
swirling Galactic core out of the windows 
and the presence of dozens of different types 
of aliens around her broke the illusion. Still, 
she couldn't help but wonder if some of those 
other aliens in the office were there to pay 
parking tickets. 

“Form, please,” growled the bear in a 
bored monotone, and Marta handed over 
the flexiplast pages. The bear ran her eyes 
down the form. She stamped an item here 
and circled an item there with a laser pen. 
She stopped and looked up from the form. 
With pronounced disapproval she said: “You 
didn't fill in Box 27” 

Marta had been expecting that. “Right,” 
she said. “I wasn't sure what to put down. 
What does ‘Society Code’ mean?” 

The bear sighed. “Every member species 
has a specific number and name.” Marta 
found the droning quality of her rough voice 
actually quite charming. The bear had given 
this speech, what, a thousand times before? 
That’s how many member worlds were in the 
Confederation. “We'll assign your catalogue 
number, but you need to provide the name 
you will all be known by’ 

Marta felt confused. Her face must have 
shown it. 

The bear said: “I can see we'll have to go 
through the whole zarfasnop. Let’s try to 

make this quick. Says 
> NATURE.COM here your planet’s 
Follow Futures: name is Earth. Does 
 @NatureFutures that mean anything 
Ei gonaturecom/mtoodm = in particular?” 
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BOX 27 


A question of definition. 


“Ummm, land, soil, dirt.” 

“Oh yeah. I’ve heard about your planet. 
Earth is the place with all the water, but 
you named it Dirt. Makes a lot of sense. 
How about if you call yourselves The 
Crazies. That’s short at least. Raljevanites call 
themselves We Who Created The Ice-Based 
Food That Is Sweet But Causes Your Head To 
Hurt. That's their planet's claim to fame and 
they’re sticking with it for all it’s worth. If you 
were from Melton II, you would be part of 
The Survivors. That’s nice. Also short. Any- 
way, The Crazies, then?” Her pen hovered 
over the form. 

“Wait!” cried Marta. The Crazies! She 
could just hear her boss exploding over 
that. Forget her boss — the 8 billion people 
of Earth would not be pleased that the rest of 
the Galaxy would know them as The Crazies. 
“Let’s try something else. Ummm, we also 
call ourselves Terrans.” 

“What does that mean?” 

“People who live on Terra... Oh, but that 
just means land” 

“Which is dirt. The Dirties?” 

Marta shook her head. “Oh! Of course! 
We call ourselves People.” Why hadn't she 
thought of that in the first place? That was 
better. That was respectable. 

The bear snorted. “Everyone calls them- 
selves People. My people call themselves 
People. That guy over there from Aquatox IV 
wearing the glass bowl that keeps his head 
immersed in water, calls his fellow creatures 
People. You'll have to try harder” 

Marta thought. “All right. We are... Homo 
sapiens. The Wise Man. And Woman. Tech- 
nically we're Homo sapiens sapiens. So how 
about The Really Wise People?” 


R 2016 


“Didnt your planet almost blow itself up 
with nuclear weapons —” 

“But we didn't.” 

“— and then almost destroy itself through 
overheating?” 

“But we didn't.” Still, she could see the 
bear’s point. Humans had eventually 
mastered atomic power constructively, 
restored the planet's ecology, and made it 
to the stars. Now they were even joining a 
confederation of fellow space-farers. We had 
come far, but perhaps it was stretching it a bit 
to wish to be known as Really Wise. 

“Besides,” said the bear. “The Really Wise 
People is already taken.” 

“By whom?” Marta was indignant. Who 
could be so egotistical as to actually use that? 

The bear gave a wry, growling chuckle. 
“My People.” 

Marta shook her long black hair and 
composed herself. “All right ... oh! The 
Humans!” 

“What does that mean?” 

Marta sighed. “People.” 

“Your species has a real talent for circu- 
lar thought. You want to reconsider The 
Crazies?” 

“No.” Marta closed her eyes and concen- 
trated hard. Humans were indeed crazy, 
dirty, noisy, lovable, brilliant, joyous... 
Marta wasn't getting anywhere. This was 
frustrating. She looked at the bear. “What’s 
your name?” She couldnt keep on thinking 
of this Really Wise Person as a bear. 

“Why do you want to know?” 

“Tt's what humans do. We connect.” 

“Tam called Weylo”” 

“Hi, Weylo.” Marta smiled. “Truth is, we 
humans don't really know who we are. We're 
just beginning to figure that out. That's partly 
why we’re out here. Each of us is a human, 
but we're really at our best when we're con- 
nected. It’s the things we do together that 
makes us Humanity” 

“Humanity?” 

“Yeah, Humanity.” That was it. It meant 
not just all of us, but all of us connected. 

“At least it’s short.” Was that a smile on 
Weylo’s muzzle? She gave the form one final, 
resounding stamp. “Welcome to the Galac- 
tic Confederation, Marta Rilla-Chen... and 
Humanity: = 


Kevin Lauderdale’ fiction has appeared 
in several of Pocket Books’ Star Trek 
anthologies as well as numerous cross-genre 
collections. Visit him at KevinLauderdale. 
Livejournal.com. 
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